Eukaryotic cells comprising adenovirus-associated virus polynucleotides

ABSTRACT

The present inventions provide eukaryotic cells, such as mammalian cells, that comprise adeno-associated virus (AAV) polynucleotides, including AAV capsid proteins (Cap), and are capable of expressing the polypeptides encoded by the AAV polynucleotides, and thereby are capable of producing AAV, including recombinant AAV. The eukaryotic cells also may comprise adenovirus (Ad) polynucleotides. The present inventions also provide methods of expressing AAV polynucleotides, as well as Ad polynucleotides, in eukaryotic cells, such as CHO cells, HEK 293 and BHK cells. The present inventions further provides other products and methods described herein.

This Application claims priority to U.S. Application Serial No.63/256,730, filed Oct. 18, 2021, which is hereby incorporated byreference in its entirety.

FIELD OF THE INVENTIONS

The present inventions provide eukaryotic cells that compriseadeno-associated virus (AAV) polynucleotides, including AAV capsidproteins (Cap). The cells are capable of expressing the polypeptidesencoded by the AAV polynucleotides, and thereby are capable of producingAAV, including recombinant AAV. The eukaryotic cells also may compriseadenovirus (Ad) polynucleotides. The present inventions also providemethods of expressing AAV polynucleotides, as well as Adpolynucleotides, in eukaryotic cells. The present inventions alsoprovide methods for producing recombinant adeno-associated virusutilizing eukaryotic cells that express AAV and Ad polypeptides encodedby polynucleotides, as well as recombinant AAV produced by theseinventive methods. The present inventions further provide other productsand methods described herein.

REFERENCE TO ELECTRONIC SEQUENCE LISTING

The application contains a Sequence Listing, which has been submittedelectronically in .XML format and is hereby incorporated by reference inits entirety. Said .XML copy, created on Oct. 5, 2022, is named“135975-61702.xml” and is 229,879 bytes in size. The sequence listingcontained in this .XML file is part of the specification and is herebyincorporated by reference herein in its entirety.

BACKGROUND OF THE INVENTIONS

Adeno-associated virus (AAV) is a non-enveloped, single-stranded DNAvirus and is used as a gene delivery vector for both research andtherapeutics. Weitzman and Linden, Adeno-Associated Virus Biology(chapter 1), Meth. Molec. Biol. 807: 1-23 (2011). Gene transfer vectorsbased on AAV have demonstrated promise for human gene therapy based ontheir safety profile and potential to achieve long-term efficacy inanimal models. Wang et al., Nature, 18: 358-78 (2019). A major challengefor advancing AAV-based therapies into clinical development is thedifficulty and cost of producing sufficient quantities of AAV throughtransient methodologies.

AAV has been produced in HEK 293, BHK, human amniotic (for example,epithelial cells such as HAEpiC) and SF9 lines. However, expression istransient due to the use of plasmid vectors containing the necessary AAVand helper virus genes. For example, recombinant AAV production in HEK293 cells using adenovirus helper gene products utilizes adenovirus E2A,E4, VA RNA and AAV Rep and Cap, along with AAV inverted terminal repeats(ITR) flanking the polynucleotide of interest. The reliance onnon-integrated plasmid vectors means that the requisite gene productswill be lost over time and need to be continually reestablished.

The AAV genome includes a capsid gene referred to as “Cap” or “CAP”. Capin nature is translated to produce, via alternative start codons andtranscript splicing, three size-variant structural proteins referred toas VP1 (about 90 kDa), VP2 (about 72 kDa) and VP3 (about 60 kDa). An AAVcapsid contains 60 subunits total of the VP proteins. A ratio of 1:1:10is considered the most typical ratio for VP1:VP2:VP3, which is astoichiometry of 5 VP1 subunits:5 VP2 subunits:50 VP3 subunits. However,there can be variation. Wörner et al., Nature Communications 12:1642(2021). AAV polynucleotides and proteins, including CAP, can be selectedfrom any serotype.

Thus, there exists the need to develop improved cells and productionmethods that avoid the transient nature of non-integrated plasmidvectors.

SUMMARY OF THE INVENTIONS

The present inventions provide stable eukaryotic cells, such asmammalian cells (for example, primate, rodent and canine cells),comprising integrated AAV polynucleotides and Ad polynucleotides. Theprotein and VA RNA products of the Ad polynucleotides act in a helpercapacity. All AAV and Ad types are amenable for use according to thepresent inventions. The present inventions advantageously can employsite-specific integration into the cell genome, which refers topre-selected genomic sites for exogenous DNA to be inserted into acellular genome. Random insertion can be employed as well.

Herein described are polynucleotides, where each can comprise (i) apromoter, (ii) an intron, (iii) an internal ribosome entry site, (iv) apolynucleotide encoding Adeno-associated virus (AAV) Cap protein, and(v) a polyadenylation site. For example, the polynucleotide can be in aCHO cell and have the (i) promoter, (ii) intron, (iii) internal ribosomeentry site and (iv) polynucleotide encoding AAV Cap protein can beoperably linked. The polynucleotide can be integrated into a CHO cellgenome, such as a CHO chromosome. Alternatively, the polynucleotide canbe in a HEK 293 cell and have the (i) promoter, (ii) intron, (iii)internal ribosome entry site and (iv) polynucleotide encoding AAV Capprotein operably linked. The polynucleotide can be integrated into a HEK293 cell genome, such as a HEK 293 cell chromosome. Another alternativeis the polynucleotide can be in a BHK cell and have the (i) promoter,(ii) intron, (iii) internal ribosome entry site and (iv) polynucleotideencoding AAV Cap protein operably linked. The polynucleotide can beintegrated into a BHK cell genome, such as a BHK cell chromosome. In yetanother alternative is the polynucleotide can be in a human amnioticcell and have the (i) promoter, (ii) intron, (iii) internal ribosomeentry site and (iv) polynucleotide encoding AAV Cap protein operablylinked. The polynucleotide can be integrated into a human amniotic cellgenome, such as a human amniotic cell chromosome. Additionally, thepolynucleotide can be integrated into non-chromosomal locations as knownby the person skilled in the art, such as episomes.

The polynucleotide can further comprise an operator. The promoter can bea CMV promoter and the operator can be a Tet operator. AAV proteins andpolynucleotides, including CAP, can be selected from any serotype. WhenCap is from serotype 5 (“Cap5”) and expressed in a CHO cell, thepolynucleotide allows for production of AAV Cap5 VP2 and VP3 protein,wherein the amount of VP3 produced is greater than the amount of VP2produced. Production of VP1 can be less than 1% the level of VP2production. Production ratios can vary based upon experimentalconditions and analytical techniques.

Additionally, polynucleotides are described, wherein each can comprise(i) a promoter, (ii) an intron, (iii) a first internal ribosome entrysite, (iv) a first polynucleotide encoding AAV Cap protein, (v) a secondinternal ribosome entry site, (vi) a second polynucleotide encoding AAVCap protein, and (vii) a polyadenylation site. For example, thepolynucleotide can be in a CHO cell and the (i) promoter, (ii) intron,(iii) first internal ribosome entry site, (iv) first polynucleotideencoding AAV Cap protein, (v) a second internal ribosome entry site, and(vi) second polynucleotide encoding AAV Cap protein can be operablylinked. The polynucleotide can be integrated into a CHO cell genome.Alternatively, the polynucleotide can be in a HEK 293 cell and the (i)promoter, (ii) intron, (iii) first internal ribosome entry site, (iv)first polynucleotide encoding AAV Cap protein, (v) a second internalribosome entry site, and (vi) second polynucleotide encoding AAV Capprotein are operably linked. The polynucleotide can be integrated into aHEK 293 cell genome. In another alternative, the polynucleotide can bein a BHK cell and the (i) promoter, (ii) intron, (iii) first internalribosome entry site, (iv) first polynucleotide encoding AAV Cap protein,(v) a second internal ribosome entry site, and (vi) secondpolynucleotide encoding AAV Cap protein are operably linked. Thepolynucleotide can be integrated into a BHK cell genome. In anotheralternative, the polynucleotide can be in a human amniotic cell and the(i) promoter, (ii) intron, (iii) first internal ribosome entry site,(iv) first polynucleotide encoding AAV Cap protein, (v) a secondinternal ribosome entry site, and (vi) second polynucleotide encodingAAV Cap protein are operably linked. The polynucleotide can beintegrated into a human amniotic cell genome. The polynucleotide canfurther comprise an operator. The promoter can be a CMV promoter and theoperator can be a Tet operator. AAV proteins and polynucleotides,including CAP, can be selected from any serotype. When Cap is fromserotype 5 (“Cap5”) and expressed in a CHO cell, the polynucleotideallows for production of AAV Cap5 proteins VP1, VP2 and VP3. The amountof VP3 production can be greater than the amount of VP1 production andthe amount of VP2 production. The amount of VP2 production can begreater than the amount of VP1 production. Production ratios can varybased upon experimental conditions and analytical techniques.

Moreover, there are described polynucleotides, wherein eachpolynucleotide can comprise (i) a promoter, (ii) an intron, (iii) apolynucleotide encoding AAV Cap protein, and (iv) a polyadenylationsite, wherein the polynucleotide allows for production of AAV Cap VP1protein when expressed. For example, the polynucleotide can be in a CHOcell and the (i) promoter, (ii) intron, and (iii) polynucleotideencoding AAV Cap protein can be operably linked. The polynucleotide canbe integrated into a CHO cell genome. Alternatively, the polynucleotidecan be in a HEK 293 cell and the (i) promoter, (ii) intron, and (iii)polynucleotide encoding AAV Cap protein can be operably linked. Thepolynucleotide can be integrated into a HEK 293 cell genome. In anotheralternative, the polynucleotide can be in a BHK cell and the (i)promoter, (ii) intron, and (iii) polynucleotide encoding AAV Cap proteincan be operably linked. The polynucleotide can be integrated into a BHKcell genome. In still another alternative, the polynucleotide can be ina human amniotic cell and the (i) promoter, (ii) intron, and (iii)polynucleotide encoding AAV Cap protein can be operably linked. Thepolynucleotide can be integrated into a human amniotic cell genome. Thepolynucleotide can further comprise an operator. The promoter can be aCMV promoter and the operator can be a Tet operator.

Furthermore, there are described polynucleotides, wherein each cancomprise (i) a promoter, (ii) an internal ribosome binding site, (iii) apolynucleotide encoding AAV Cap protein, and (iv) a polyadenylationsite, wherein the polynucleotide allows for production of AAV VP1 Capprotein when expressed. The polynucleotide can be in a CHO cell and the(i) promoter, (ii) internal ribosome binding site, and (iii)polynucleotide encoding AAV Cap protein can be operably linked. Thepolynucleotide can be integrated into a CHO genome. Alternatively, thepolynucleotide can be in a HEK 293 cell and the (i) promoter, (ii)internal ribosome binding site, and (iii) polynucleotide encoding AAVCap protein can be operably linked. The polynucleotide can be integratedinto a HEK 293 cell genome. In another alternative, the polynucleotidecan be in a BHK cell and the (i) promoter, (ii) internal ribosomebinding site, and (iii) polynucleotide encoding AAV Cap protein can beoperably linked. The polynucleotide can be integrated into a BHK cellgenome. In yet another alternative, the polynucleotide can be in a humanamniotic cell and the (i) promoter, (ii) internal ribosome binding site,and (iii) polynucleotide encoding AAV Cap protein can be operablylinked. The polynucleotide can be integrated into a human amniotic cellgenome. The polynucleotide can comprise an operator. The promoter can bea CMV promoter and the operator can be a Tet operator.

Additionally, there are described eukaryotic cells, where each cell cancomprise a polynucleotide comprising (i) a promoter, (ii) an intron,(iii) a first internal ribosome entry site, (iv) a first polynucleotideencoding AAV Cap protein, (v) a second internal ribosome entry site,(vi) a second polynucleotide encoding AAV Cap protein, and (vii) apolyadenylation site. The (i) promoter, (ii) intron, (iii) firstinternal ribosome entry site, (iv) first polynucleotide encoding AAV Capprotein, (v) a second internal ribosome entry site, and (vi) secondpolynucleotide encoding AAV Cap protein can be operably linked. Thepolynucleotide can be integrated into a cell genome. The cell can be aCHO cell, a HEK 293 cell, a BHK cell, a Human Amniotic Cell or othereukaryotic cell. The cell can further comprise an operator. The promotercan be a CMV promoter and the operator can be a Tet operator. The cellcan further comprise: a polynucleotide encoding AAV Rep, apolynucleotide encoding Ad E1A, a polynucleotide encoding Ad E1B, apolynucleotide encoding Ad E2A or E2A orf, a polynucleotide encoding AdE4 or E4 orf 6, a polynucleotide encoding VA RNA, and a polynucleotideencoding AAV ITRs and a protein of interest.

There are also described eukaryotic cells, where each cell can comprise(A) a first polynucleotide comprising (i) a promoter, (ii) an intron,(iii) an internal ribosome entry site, (iv) a polynucleotide encodingAAV Cap protein, and (v) a polyadenylation site; and (B) a secondpolynucleotide comprising (i) a promoter, (ii) an intron, (iii) apolynucleotide encoding AAV Cap protein, and (iv) a polyadenylationsite. The (i) promoter, (ii) intron, (iii) internal ribosome entry siteand (iv) polynucleotide encoding AAV Cap protein of (A) firstpolynucleotide can be operably linked, and wherein the (i) promoter,(ii) intron, and (iii) polynucleotide encoding AAV Cap protein of (B)second polynucleotide can be operably linked. The cell can have at leastone polynucleotide integrated into a cell genome. The cell can be a CHOcell, a HEK 293 cell, a BHK cell, a human amniotic cell or othereukaryotic cell. The cell can further comprise an operator. The promotercan be a CMV promoter and the operator can be a Tet operator. The cellcan further comprise a polynucleotide encoding AAV Rep, a polynucleotideencoding Ad E1A, a polynucleotide encoding Ad E1B, a polynucleotideencoding Ad E2A or E2A orf, a polynucleotide encoding E4 or E4 orf 6, apolynucleotide encoding VA RNA, and a polynucleotide encoding AAV ITRsand a protein of interest.

Additionally, there are described eukaryotic cells, where each cell cancomprise: (A) a first polynucleotide comprising (i) a promoter, (ii) anintron, (iii) an internal ribosome entry site, (iv) a polynucleotideencoding AAV Cap protein, and (v) a polyadenylation site; and (B) asecond polynucleotide comprising (i) a promoter, (ii) an internalribosome entry site, (iii) a polynucleotide encoding AAV Cap protein,and (iv) a polyadenylation site. The cells can have the (i) promoter,(ii) intron, (iii) internal ribosome entry site and (iv) polynucleotideencoding AAV Cap protein of (A) first polynucleotide operably linked,and the (i) promoter, (ii) internal ribosome entry site, and (iii)polynucleotide encoding AAV Cap protein of (B) second polynucleotideoperably linked. At least one polynucleotide can be integrated into acell genome. The cell can be a CHO cell, a HEK 293 cell, a BHK cell, ahuman amniotic cell or other eukaryotic cell. The cell can furthercomprise an operator. The promoter can be a CMV promoter and theoperator can be a Tet operator. The cell can further comprise: apolynucleotide encoding AAV Rep, a polynucleotide encoding Ad E1A, apolynucleotide encoding Ad E1B, a polynucleotide encoding Ad E2A or E2Aorf, a polynucleotide encoding Ad E4 or E4 orf 6, a polynucleotideencoding VA RNA, and a polynucleotide encoding AAV ITRs and a protein ofinterest.

There also are described cell cultures comprising any of the above cellsin any type of media, including growth media and maintenance media.Additionally, there are described methods of producing AAV proteins,including Cap proteins, and methods that can result in the production ofrecombinant AAV.

There are described methods of producing adeno-associated virus (AAV)Cap protein in cell culture, wherein a method comprises the steps of:providing eukaryotic cells, wherein a cell comprises a polynucleotidecomprising (i) a promoter, (ii) an intron, (iii) a first internalribosome entry site, (iv) a first polynucleotide encoding AAV Capprotein, (v) a second internal ribosome entry site, (vi) a secondpolynucleotide encoding AAV Cap protein, and (vii) a polyadenylationsite; and culturing the cells in a culture medium to allow the cells toproduce AAV Cap protein, wherein the polynucleotide allows forproduction of AAV Cap proteins VP1, VP2 and VP3. The (i) promoter, (ii)intron, (iii) first internal ribosome entry site, (iv) firstpolynucleotide encoding AAV Cap protein, (v) a second internal ribosomeentry site, and (vi) second polynucleotide encoding AAV Cap protein canbe operably linked. The polynucleotide can be integrated into a cellgenome. The cell can be a CHO cell, a HEK 293 cell, a BHK cell, a humanamniotic cell or other eukaryotic cell. The cell can further comprise anoperator. The cell can further comprise: a polynucleotide encoding AAVRep, a polynucleotide encoding Ad E1A, a polynucleotide encoding Ad E1B,a polynucleotide encoding Ad E2A or E2A orf, a polynucleotide encodingAd E4 or E4 orf 6, a polynucleotide encoding VA RNA, and apolynucleotide encoding AAV ITRs and a protein of interest, wherein thecell can produce recombinant AAV.

Also described are methods of producing adeno-associated virus (AAV) Capprotein in cell culture, wherein a method comprises the steps ofproviding eukaryotic cells, where a cell comprises (a) a firstpolynucleotide comprising (i) a promoter, (ii) an intron, (iii) aninternal ribosome entry site, (iv) a polynucleotide encoding AAV Capprotein, and (v) a polyadenylation site; and (b) a second polynucleotidecomprising (i) a promoter, (ii) an intron, (iii) a polynucleotideencoding AAV Cap protein, and (iv) a polyadenylation site; and culturingthe cells in a culture medium to allow the cells to produce AAV Capprotein, wherein the polynucleotide allows for production of AAV Capproteins VP1, VP2 and VP3. The (i) promoter, (ii) intron, (iii) internalribosome entry site and (iv) polynucleotide encoding AAV Cap protein of(a) first polynucleotide can be operably linked, and the (i) promoter,(ii) intron, and (iii) polynucleotide encoding AAV Cap protein of (b)second polynucleotide can be operably linked. The polynucleotide can beintegrated into a cell genome. The cell can be a CHO cell, a HEK 293cell, a BHK cell, a human amniotic cell or other eukaryotic cell. Thecell can further comprise an operator. The promoter can be a CMVpromoter and the operator can be a Tet operator. The cell can furthercomprise: a polynucleotide encoding AAV Rep, a polynucleotide encodingAd E1A, a polynucleotide encoding Ad E1B, a polynucleotide encoding AdE2A or E2A orf, a polynucleotide encoding Ad E4 or E4 orf 6, apolynucleotide encoding VA RNA, and a polynucleotide encoding AAV ITRsand a protein of interest, wherein the cell can produce recombinant AAV.

There are also described methods of producing adeno-associated virus(AAV) Cap protein in cell culture, wherein a method comprises the stepsof: providing eukaryotic cells, where a cell comprises (a) a firstpolynucleotide comprising (i) a promoter, (ii) an intron, (iii) aninternal ribosome entry site, (iv) a polynucleotide encoding AAV Capprotein, and (v) a polyadenylation site; and (b) a second polynucleotidecomprising (i) a promoter, (ii) an internal ribosome entry site, (iii) apolynucleotide encoding AAV Cap protein, and (iv) a polyadenylationsite; and culturing the cells in a culture medium to allow the cells toproduce AAV Cap protein, wherein the polynucleotide allows forproduction of AAV Cap proteins VP1, VP2 and VP3. The (i) promoter, (ii)intron, (iii) internal ribosome entry site and (iv) polynucleotideencoding AAV Cap protein of (a) first polynucleotide can be operablylinked, and the (i) promoter, (ii) internal ribosome entry site, and(iii) polynucleotide encoding AAV Cap protein of (b) secondpolynucleotide can be operably linked. The polynucleotide can beintegrated into a cell genome. The cell can be a CHO cell, a HEK 293cell, a BHK cell, a human amniotic cell or other eukaryotic cell. Thecell can further comprise an operator. The promoter can be a CMVpromoter and the operator can be a Tet operator. The cell can furthercomprise: a polynucleotide encoding AAV Rep, a polynucleotide encodingAd E1A, a polynucleotide encoding Ad E1B, a polynucleotide encoding AdE2A or E2A orf, a polynucleotide encoding Ad E4 or E4 orf 6, apolynucleotide encoding VA RNA, and a polynucleotide encoding AAV ITRsand a protein of interest, wherein the cell can produce recombinant AAV.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 is a schematic diagram of a polynucleotide comprising a promoter,an intron, an internal ribosome entry site (IRES), a polynucleotideencoding AAV Cap protein and a polyadenylation site (poly A).

FIG. 2 is a schematic diagram of a polynucleotide comprising a promoter,an intron, two internal ribosome entry sites, two polynucleotidesencoding AAV Cap protein and a polyadenylation site.

FIG. 3 is a schematic diagram of a polynucleotide comprising a promoter,an intron, a polynucleotide encoding AAV Cap protein and apolyadenylation site.

FIG. 4 is a schematic diagram of a polynucleotide comprising a promoter,an internal ribosome entry site, a polynucleotide encoding AAV Capprotein and a polyadenylation site.

FIG. 5 is a schematic diagram of a polynucleotide comprising a promoterwith operator (Op), an intron, an internal ribosome entry site, apolynucleotide encoding AAV Cap protein and a polyadenylation site.

FIG. 6 is a schematic diagram of a polynucleotide comprising a promoterwith operator (Op), an intron, two internal ribosome entry sites, twopolynucleotides encoding AAV Cap protein and a polyadenylation site.

FIG. 7 is a schematic diagram of a polynucleotide comprising a promoterwith operator (Op), an intron, a polynucleotide encoding AAV Cap proteinand a polyadenylation site.

FIG. 8 is a schematic diagram of a polynucleotide comprising a promoterwith operator (Op), an internal ribosome entry site, a polynucleotideencoding AAV Cap protein and a polyadenylation site.

DETAILED DESCRIPTION OF THE INVENTIONS

Unless defined otherwise, all technical and scientific terms used hereinhave the same meaning as commonly understood by one of ordinary skill inthe art to which these inventions belong.

Definitions

The term “about” in the context of numerical values and ranges refers tovalues or ranges that approximate or are close to the recited values orranges such that the inventions can perform, such as having a soughtrate, amount, degree, increase, decrease, or extent of expression,concentration, or time, as is apparent from the teachings containedherein. Thus, this term encompasses values beyond those simply resultingfrom systematic error. For example, “about” can signify values eitherabove or below the stated value in a range of approx. +/- 10% or more orless depending on the ability to perform.

“Intron” is a section of DNA located between exons. An intron is removedto form a mature messenger RNA. Preferred introns are those that canaffect the starting point of translation, and exemplars are the hCMV-IEintron (Human cytomegalovirus immediate early protein) and FMDV intron(Foot and Mouth Disease Virus). The globin gene intron also has beenreportedly used for expression.

A “nucleic acid moiety” includes any arrangement of single stranded ordouble stranded nucleotide sequences. Nucleic acid moieties can include,but are not limited to, polynucleotides, promoters, enhancers,operators, repressors, transcription termination signals, ribosomalentry sites and polyadenylation signals.

A “DNA cassette” or “cassette” is a type of nucleic acid moiety thatcomprises at least a promoter, at least one open reading frame andoptionally a polyadenylation signal, for example an SV40 polyadenylationsignal. Other nucleic acid moieties, such as operators, also areoptional. A DNA cassette thus is a polynucleotide that comprises two ormore shorter polynucleotides.

“Operably linked” refers to one or more nucleotide sequences infunctional relationships with one or more other nucleotide sequences.Such functional relationships can directly or indirectly control, cause,regulate, enhance, facilitate, permit, attenuate, repress or block anaction or activity in accordance with the selected design. Exemplarsinclude single-stranded or double-stranded nucleic acid moieties, andcan comprise two or more nucleotide sequences arranged within a givenmoiety in such a way that sequence(s) can exert at least one functionaleffect on other(s). For example, a promoter operably linked to thecoding region of a DNA polynucleotide sequence can facilitatetranscription of the coding region. Other elements, such as enhancers,operators, repressors, transcription termination signals, ribosomalentry sites and polyadenylation signals also can be operably linked witha polynucleotide of interest to control its expression. Arrangements andspacing to achieve operable linkages can be ascertained by approachesavailable to the person skilled in the art, such as screening usingwestern blots and RT-PCR.

“Operator” indicates a DNA sequence that is introduced in or near apolynucleotide sequence in such a way that the polynucleotide sequencemay be regulated by the interaction of a molecule capable of binding tothe operator and, as a result, prevent or allow transcription of thepolynucleotide sequence, as the case may be. One skilled in the art willrecognize that the operator must be located sufficiently in proximity tothe promoter such that it is capable of controlling or influencingtranscription by the promoter, which can be considered a type ofoperable linkage. The operator may be placed either downstream orupstream of the promoter. These include, but are not limited to, theoperator region of the Lex A gene of E. coli, which binds the Lex Apeptide and the lactose and 45 tryptophan operators, which bind therepressor proteins encoded by the Lad and trpR genes of E. coli. Thebacteriophage operators from the lambda Pi and the phage P22 Mnt andArc. Preferred operators are the Tet (tetracycline) operator and the Arcoperator. Operators can have a native sequence or a mutant sequence. Forexample, mutant sequences of the Tet operator are disclosed in Wissmannet al., Nucleic Acids Res. 14: 4253-66 (1986).

The phrases “percent identity” or “% identical,” in their variousgrammatical forms, when describing a sequence is meant to includehomologous sequences that display the recited identity along regions ofcontiguous homology, but the presence of gaps, deletions, or insertionsthat have no homolog in the compared sequence are not taken into accountin calculating percent identity. As used herein, a “percent identity” or“% identical” determination between homologs would not include acomparison of sequences where the homolog has no homologous sequence tocompare in an alignment. Thus, “percent identity” and “% identical” donot include penalties tor gaps, deletions, and insertions.

A “homologous sequence” in the context of nucleic acid sequences refersto a sequence that is substantially homologous to a reference nucleicacid sequence. In some embodiments, two sequences are considered to besubstantially homologous if at least 50%, 55%, 60%, 65%, 70%, 75%, 80%,85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more of theircorresponding nucleotides are identical over a relevant stretch ofresidues. In some embodiments, the relevant stretch is a complete (i.e.,full) sequence.

“Polynucleotide” includes a sequence of nucleotides covalently joined,and includes RNA and DNA. Oligonucleotides are considered shorterpolynucleotides. Genes are DNA polynucleotides (polydeoxyribonucleicacid) that ultimately encode polypeptides, which are translated from RNA(polyribonucleic acid) that was typically transcribed from DNA. DNApolynucleotides also can encode RNA polynucleotides that is nottranslated, but rather function as RNA “products”. The type ofpolynucleotide (that is, DNA or RNA) is apparent from the context of theusage of the term. A polynucleotide referred to or identified by thepolypeptide it encodes sets forth and covers all suitable sequences inaccordance with codon degeneracy. Polynucleotides, including thosedisclosed herein, include percent identity sequences and homologoussequences when indicated.

“Polypeptide” or “peptide” refers to sequence(s) of amino acidscovalently joined. Polypeptides include natural, semi-synthetic andsynthetic proteins and protein fragments. “Polypeptide” and “protein”can be used interchangeably. Oligopeptides are considered shorterpolypeptides.

“Protein of interest” or “polypeptide of interest” can have any aminoacid sequence, and includes any protein, polypeptide, or peptide, andderivatives, components, domains, chains and fragments thereof. Includedare, but not limited to, viral proteins, bacterial proteins, fungalproteins, plant proteins and animal (including human) proteins. Proteintypes can include, but are not limited to, antibodies, bi-specificantibodies, multi-specific antibodies, antibody chains (including heavyand light), antibody fragments, Fv fragments, Fc fragments,Fc-containing proteins, Fc-fusion proteins, receptor Fc-fusion proteins,receptors, receptor domains, trap and mini-trap proteins, enzymes,factors, repressors, activators, ligands, reporter proteins, selectionproteins, protein hormones, protein toxins, structural proteins, storageproteins, transport proteins, neurotransmitters and contractileproteins. Derivatives, components, chains and fragments of the abovealso are included. The sequences can be natural, semi-synthetic orsynthetic. Proteins of interest and polypeptides of interest are encodedby “genes of interest,” which also can be referred to as“polynucleotides of interest.” Where multiple genes (same or different)are integrated, they can be referred to as “first,” “second”, “third,”“fourth,” “fifth,” “sixth,” “seventh,” “eighth,” “ninth,” “tenth,” etc.as is apparent from the context of use.

“Promoter” indicates a DNA sequence that cause transcription of a DNAsequence to which it is operably linked, i.e., linked in such a way asto permit transcription of the nucleotide sequence of interest when theappropriate signals are present and repressors are absent. Theexpression of a polynucleotide of interest may be placed under controlof any promoter or enhancer element known in the art. A eukaryoticpromoter can be operably linked to a TATA Box. The TATA Box is typicallylocated upstream of the transcription start site.

Useful promoters that may be used include, but are not limited to, theSV40 early promoter region, SV40 E/L (early late) promoter, the promotercontained in the 3′ long terminal repeat of Rous sarcoma virus, theregulatory sequences of the metallothionein gene, mouse or humancytomegalovirus major immediate early (CMV-MIE) promoter and other CMVpromoters, including CMVmin promoters. Plant expression vectorscomprising the nopaline synthetase promoter region, the cauliflowermosaic virus 35S RNA promoter, and the promoter of the photosyntheticenzyme ribulose biphosphate carboxylase; promoter elements from yeast orother fungi such as the Gal 4 promoter, the ADC (alcohol dehydrogenase)promoter, PGK (phosphoglycerol kinase) promoter, alkaline phosphatasepromoter, and the following animal transcriptional control regions,which exhibit tissue specificity and have been utilized in transgenicanimals: elastase I; insulin; immunoglobulin; mouse mammary tumor virus;albumin; C.-feto protein; C.1-antitrypsin; 3-globin, and myosin lightchain-2. Various forms of the CMV promoter can be used according to theinventions.

Minimal promoters, such as CMVmin promoters, can be truncated promotersor core promoters and are preferred for use in controlled expressionsystems. Minimal promoters and development approaches are widely knownand disclosed in, for example, Saxena et al., Methods Molec. Biol.1651:263-73 (2017); Ede et al., ACS Synth Biol. 5:395-404 (2016); Brownet al., Biotech Bioeng. 111 :1638-47 (2014); Morita et al.,Biotechniques 0:1-5 (2012); Lagrange et al., Genes Dev. 12:34-44 (1998).There are many CMVmin promoters described in the field.

“Reporter proteins” as used herein, refers to any protein capable ofgenerating a detectable signal. Reporter proteins typically fluoresce,or catalyze a colorimetric or fluorescent reaction, and often arereferred to as “fluorescent proteins” or “color proteins.” However, areporter protein also can be non-enzymatic and non-fluorescent as longas it can be detected by another protein or moiety, such as a cellsurface protein detected with a fluorescent ligand. A reporter proteinalso can be an inactive protein that is made functional throughinteraction with another protein that is fluorescent or catalyzes areaction. Accordingly, any suitable reporter protein, as understood byone of skill in the art, could be used. In some aspects, the reporterprotein may be selected from fluorescent protein, luciferase, alkalinephosphatase, β-galactosidase, β-lactamase, dihydrofolate reductase,ubiquitin, and variants thereof. Fluorescent proteins are useful for therecognition of gene cassettes that have or have not been successfullyinserted and/or replaced, as the case may be. Fluid cytometry andfluorescence-activated cell sorting are suitable for detection. Examplesof fluorescent proteins are well-known in the art, including, but notlimited to Discosoma coral (DsRed), green fluorescent protein (GFP),enhanced green fluorescent protein (eGFP), cyano fluorescent protein(CFP), enhanced cyano fluorescent protein (eCFP), yellow fluorescentprotein (YFP), enhanced yellow fluorescent protein (eYFP) and far-redfluorescent protein (e.g. mKate, mKate2, mPlum, mRaspberry orE2-crimson. See, for example, U.S. Pat. Nos. 9,816,110. Reporterproteins are encoded by polynucleotides, and are referred to herein as“reporter genes” or “reporter protein genes.” Reporters can beconsidered a type of marker. “Color” or “fluorescent,” in their variousgrammatical forms, also can be used the more specifically refer to areporter protein or gene..

A “repressor protein”, also referred to as a “repressor,” is a proteinthat can bind to DNA in order to repressor transcription. Repressors areof eukaryotic and prokaryotic origin. Prokaryotic repressors arepreferred. Examples of repressor families include: TetR, LysR, LacI,ArsR, IcIR, MerR, AsnC, MarR, DeoR, GntR and Crp families. Repressorproteins in the TetR family include: ArcR, ActII, AmeR, AmrR, ArpR,BpeR, EnvR, EthR, HemR, HydR, IfeR, LanK, LfrR, LmrA, MtrR, Pip, PqrA,QacR, RifQ, RmrR, SimReg2, SmeT, SrpR, TcmR, TetR, TtgR, TrgW, UrdK,VarR YdeS, ArpA., BarA, Aur1B, CaIR1, CprB, FarA, JadR*, JadR2, MphB,NonG, PhIF, TyIQ, VanT, TarA, TyIP, BM1P1, Bm3R1, ButR, CampR, CamR,DhaR, KstR, LexA-like, AcnR, PaaRR, Psbl, Th1R, UidR, YDH1, BetI, McbR,MphR, PhaD, Q9ZF45, TtK, Yhgd, YixD, CasR, IcaR, LitR, LuxR, LuxT, OpaR,Orf2, SmcR, HapR, Ef0113, HIyIIR, BarB, ScbR, MmfR, AmtR, PsrA andYjdCproteins See Ramos et al., Microbiol. Mol. Biol. Rev., 69: 326-56(2005). Still other repressors include PurR, LacR, MetJ and PadR,Repressor proteins are encoded by genes referred to as “repressor genes”or “repressor protein genes.”

“Selectable” or “selection” marker proteins include proteins conferringcertain traits, including but not limited to drug resistance or otherselective advantages. Selection markers can give the cell receiving theselectable marker gene resistance towards a certain toxin, drug,antibiotic or other compound and permit the cell to produce protein andpropagate in the presence of the toxin, drug, antibiotic or othercompound, and are often referred to as “positive selectable markers.”Suitable examples of antibiotic resistance markers include, but are notlimited to, proteins that impart resistance to various antibiotics, suchas kanamycin, spectinomycin, neomycin, gentamycin (G418), ampicillin,tetracycline, chloramphenicol, puromycin, hygromycin, zeocin, and/orblasticidin. There are other selectable markers, often referred to as“negative selectable markers,” which cause a cell to stop propagating,stop protein production and/or are lethal to the cell in the presence ofthe negative selectable marker proteins. Thymidine kinase and certainfusion proteins can serve as negative selectable markers, including butnot limited to GyrB-PKR. See White et al., Biotechniques, 50: 303-309(May 2011). Selectable marker proteins and corresponding genes can bereferred to generically as first (1), second (2), third (3), fourth (4),fifth (5), sixth (6), seventh (7), eighth (8), ninth (9), tenth (10),etc., as is apparent from the context of usage..

All numerical limits and ranges set forth herein include all numbers orvalues thereabout or there between of the numbers of the range or limit.The ranges and limits described herein expressly denominate and setforth all integers, decimals and fractional values defined andencompassed by the range or limit.

Description

The inventions provide cells comprising AAV and optionally Adpolynucleotide sequences to allow production of recombinant AAVcomprising a polynucleotide of interest, such as a gene or othersequence encoding a polypeptide of interest. The AAV and Adpolynucleotides provide the requisite structural and helper productsrequired for AAV production.

AAV polynucleotides, and optionally Ad polynucleotides, can beintegrated using a recombinase-mediated cassette exchange (RMCE), forexample. “Stable” in the context of cell integration refers to apolynucleotide of interest, such as a gene, introduced into the genomeof a cell and can be passed to subsequent generations of cells, andthereby can provide cell lines that are genetically homogeneous for aperiod of time.

Cells that are suitable for use with the inventions can be readilyselected by those of skill in the art. In some embodiments the cell lineis a eukaryotic cell line such as a yeast cell line, insect cell line(for example, Sf9 and Sf21 cells) or a mammalian cell line. Preferredmammalian cells include primate cells (including human), canine cellsand rodent cells. Cells can be primary cells or immortalized cells.Suitable cells can be selected from Vero cells, COS cells, HEK 293cells, HeLa cells, CHO cells, BHK cells, MDCK cells, amniotic cells(human), embryonic cells, cell lines transfected with viral genes, forexample, AD5 E1, including but not limited to an immortalized humanretinal cell transfected with an adenovirus gene, for example, a PER.C6cell, or an NSO cell. In some embodiments, the cell is a Chinese hamsterovary (CHO) cell line. Some examples of CHO cells include, but are notlimited to, CHO-ori, CHO-K1, CHO-s, CHO-DHB11, CHO-DXB11, CHO-K1SV, andmutants and variants thereof. In other embodiments, the cell is a HEK293cell. Some examples of HEK293 cells include, but are not limited, toHEK293, HEK293A, HEK293E, HEK293F, HEK293FT, HEK293FTM, HEK293H,HEK293MSR, HEK293S, HEK293SG, HEK293SGGD, HEK293T and mutants andvariants thereof.

For hamster cells such as CHO and BHK, integration can be accomplishedby inventions disclosed in U.S. Pat. Nos. 7,771,997 (“Stable Site 1”)and 9,816,110 (“Stable Site 2”), which are hereby incorporated byreference, including sequence information. Regeneron provides a suite ofgoods and services referred to as EESYR®. CHO cells with integratedsequences in Stable Site 1 and Stable Site 2 are disclosed in US2019/0233544 A1, which is hereby incorporated by reference, includingsequence information. Sequences set forth in these patents and Examples14 and 15 can be used according to the inventions described and depictedherein. Additionally, an AAVS1-like region and the COSMC locus inhamster cells can be used according to the inventions.

Where human cells are employed, integration into adeno-associated virusintegration site 1 (AAVS1) can be undertaken. See Lou et al., Human GeneTherapy Methods, 28: 124-38 (2017); Liu et al., BMC Research Note, 7:626(2014). AAVS1 is reported to be located on chromosome 19. Otherintegration sites in human cells can be used as well, such as CCR5 andhROSA26.

Modification of cellular genomes can be undertaken with knownapproaches, such as Cre/Lox, Flp/Frt, transcription activator-likeeffector nuclease (TALEN), a TAL effector domain fusion protein, zincfinger nuclease (ZFN), a ZFN dimer, or a RNA-guided DNA endonucleasesystem, such as CRISPR/Cas9. See U.S. Pat. No. 9,816,110 at cols. 17-18.Integration using Bxb1 integrase in human, mouse and rat cells also canbe undertaken. Russell et al., Biotechniques 40: 460-64 (2006).

To maximize stability and efficiency and facilitate integration andcontrol of the inventions, Stable Integration Sites (SIS) can be createdusing Genomic Safe Harbors and the like in a wide variety of cell typesand lines according to the teachings of U.S. Serial No. 63/256,675. Thedescriptions (including examples) and figures providing methods andcells resulting from the methods of U.S. Serial No. 63/256,675 arehereby incorporated by reference.

For production of recombinant AAV, the inventions provide forintegration of AAV Cap to produce size variants VP1 (about 90 kDa), VP2(about 72 kDa) and VP3 (about 60 kDa). The variants differ at theirN-terminus.

Usually, recombinant AAV will contain a gene-of interest (GOI) flankedby AAV ITRs (inverted terminal repeats), For production of recombinantAAV, seven additional polynucleotides, namely adenovirus E1A, E1B, E4,E2A, VA RNA and AAV Rep and Cap, are typically employed for production.

FIGS. 1-4 depict constructs that permit constitutive expression of Capprotein. FIGS. 5-8 depict constructs that permit controlled expressionof Cap protein by inclusion of an operator downstream of the promoter. Apreferred operator is the tetracycline operator (TetO), which binds thetetracycline repressor (TetR). Tetracycline, doxycycline and derivativesthereof can bind TetR so that TetR no longer binds TetO, and thus ispermissive for transcription. An example of a CMV promoter and TetO isset forth in Example 13.

The inventions is further described by the following examples, which areillustrative of the many aspects of the invention, but do not limit theinventions in any manner.

Example 1 - CHO Cells

One or more Cap-containing polynucleotides according to FIGS. 1-8 arestably inserted into the CHO genome. A preferred promoter is the hCMV-IEpromoter, and optionally a tet operator can be operably linked to thepromoter for expression control. Optionally, an intron can be located 3′of the promoter. A preferred intron is an hCMV-IE intron. AAV Cap, Repand ITRs can be obtained from any AAV serotype. Preferred AAV serotypesare AAV2 and AAV5. AAV polynucleotide sequences are set forth in Example11. Promoter, operator, IRES and intron sequences are set forth inExample 13.

AAV ITRs flanking a gene of interest, AAV Rep and Ad E1A, E1B, E2A (orE2A partial sequence (E2A orf)), E4 (or E4 partial sequence (E4 orf 6))and VA RNA can be randomly integrated, site-specifically integrated orremain on a plasmid. Adenovirus polynucleotide sequences are availableand are exemplified in Example 12. Adenovirus (Ad) proteins andpolynucleotides can be selected from any serotype.

Example 2 - HEK 293 Cells

One or more Cap-containing polynucleotides according to FIGS. 1-8 arestably inserted into the HEK 293 genome. A preferred promoter is thehCMV-IE promoter, and optionally a tet operator can be operably linkedto the promoter for expression control. Optionally, an intron can belocated 3′ of the promoter. A preferred intron is an hCMV-IE intron. AAVCap, Rep and ITRs can be obtained from any AAV serotype. Preferred AAVserotypes of AAV2 and AAV5. AAV polynucleotide sequences are set forthin Example 11.

AAV ITRs and Rep and Ad E1A, E1B, E2A (or E2A partial sequence (E2Aorf)), E4 (or E4 partial sequence (E4 orf 6)) and VA RNA can be randomlyintegrated, site-specifically integrated or remain on a plasmid.Adenovirus polynucleotide sequences are available and are exemplified inExample 12.

Example 3 - BHK Cells

BHK cells are fibroblast cells from baby hamster kidneys. There areadherent BHK lines and BHK lines that can propagate in suspension. Wentzand SchügerI, Enzyme Microbial Tech. 14: 68-75 (1992).

One or more Cap-containing polynucleotides according to FIGS. 1-8 arestably inserted into the BHK genome. A preferred promoter is the hCMV-IEpromoter, and optionally a tet operator can be operably linked to thepromoter for expression control. Optionally, an intron can be located 3′of the promoter. A preferred intron is an hCMV-IE intron. AAV Cap, Repand ITRs can be obtained from any AAV serotype. Preferred AAV serotypesof AAV2 and AAV5. AAV polynucleotide sequences are set forth in Example11.

AAV ITRs and Rep and Ad E1A, E1B, E2A (or E2A partial sequence (E2Aorf)), E4 (or E4 partial sequence (E4 orf 6)) and VA RNA can be randomlyintegrated, site-specifically integrated or remain on a plasmid.Adenovirus polynucleotide sequences are available and are exemplified inExample 12.

Example 4 - Intron IRES Cap

Embodiments of this construct are depicted in FIG. 1 (constitutive) and5 (controllable). In a cell, both embodiments can primarily produce VP2and VP3, with more VP3 being produced than VP2. In an experiment in CHOcells using Cap5, an average ratio of about 1:5.7 of VP2 to VP3 wasobserved by conducting densitometry analyses on western blots. Some VP1also can be produced, but the level observed was typically less than 1%the level of VP2 that is produced. Accordingly, this construct can beused with a construct that will produce primarily VP1. See Examples 6and 7. Production ratios can vary based upon experimental conditions andanalytical techniques. A preferred IRES is the encephalomyocarditisvirus (referred to as “EMCV” or “ECMV”) IRES.

Example 5 - Intron IRES Cap IRES Cap

Embodiments of this construct are depicted in FIG. 2 (constitutive) and6 (controllable). This construct contains two IRES polynucleotides andtwo Cap polynucleotides. In a cell, both embodiments can produce a VP1,VP2 and VP3. The amount of VP3 production can be greater than the amountof VP1 production and the amount of VP2 production. The amount of VP2production can be greater than the amount of VP1 production. In anexperiment using CHO cells containing Cap5, an average ratio of about1:2:9.3 of VP1 to VP2 to VP3 was observed by conducting densitometryanalyses on western blots. Production ratios can vary based uponexperimental conditions and analytical techniques. A preferred IRES isthe encephalomyocarditis virus (referred to as “EMCV” or “ECMV”) IRES.

Example 6 - Intron Cap

Embodiments of this construct are depicted in FIG. 3 (constitutive) and7 (controllable). This construct contains an Intron and Cappolynucleotide, and produces predominantly VP1, and can be used with theconstructs of Example 4 to produce VP1, VP2 and VP3.

Example 7 - IRES Cap

Embodiments of this construct are depicted in FIG. 4 (constitutive) and8 (controllable). This construct contains an Intron and Cappolynucleotide, and produces predominantly VP1, and can be used with theconstructs of Example 4 to produce VP1, VP2 and VP3. A preferred IRES isthe encephalomyocarditis virus (referred to as “EMCV” or “ECMV”) IRES.

Example 8 - CHO Cells Comprising Cap Constructs

CHO cells of Example 1 can comprise an Intron IRES CAP IRES CAPpolynucleotide of Example 5 (FIGS. 2 or 6 ). In an experiment, CHO cellscontaining Cap5 expressed VP1:VP2:VP3 in an observed ratio of about1:2:9.3 by conducting densitometry analyses on western blots.

As an alternative, CHO cells of Example 1 can comprise CAPpolynucleotides of Example 4 (FIGS. 1 or 5 ) and Examples 6 (FIGS. 3 or7 ) or 6 (FIGS. 4 and 8 ). Such cells would express VP1, VP2 and VP3,although possibly at different ratios than according to Example 5.

Example 9 - HEK 293 Cells Comprising Cap Constructs

HEK 293 cells of Example 2 can comprise an Intron IRES CAP IRES CAPpolynucleotide of Example 5 (FIGS. 2 or 6 ).

As an alternative, HEK 293 cells of Example 2 can comprise CAPpolynucleotides of Example 4 (FIGS. 1 or 5 ) and Examples 6 (FIGS. 3 or7 ) or 7 (FIGS. 4 and 8 ). Such cells would express VP1, VP2 and VP3,although possibly at different ratios than according to Example 5.

Example 10 - BHK Cells Comprising Cap Constructs

BHK cells of Example 3 can comprise an Intron IRES CAP IRES CAPpolynucleotide of Example 5 (FIGS. 2 or 6 ).

As an alternative, BHK cells of Example 3 can comprise CAPpolynucleotides of Example 4 (FIGS. 1 or 5 ) and Examples 6 (FIGS. 3 or7 ) or 7 (FIGS. 4 and 8 ). Such cells would express VP1, VP2 and VP3,although possibly at different ratios than according to Example 5.

Example 11 - AAV Polynucleotide Sequences

AAV Rep, Cap and ITR sequences are known in the art. The presentinventions are amenable to all AAV serotypes. AAV sequences from variousAAV serotypes are set forth below. Many of these sequences are availablefrom the National Center for Biotechnology Information (NCBI).

Aav-1 Full Genome: NC_002077

CapVP1: (SEQ ID NO: 1)

ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCGCGAGTGGTGGGACTTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAGCAGGACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACTCGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGCCTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGACGCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAGCAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCTAAGACGGCTCCTGGAAAGAAACGTCCGGTAGAGCAGTCGCCACAAGAGCCAGACTCCTCCTCGGGCATCGGCAAGACAGGCCAGCAGCCCGCTAAAAAGAGACTCAATTTTGGTCAGACTGGCGACTCAGAGTCAGTCCCCGATCCACAACCTCTCGGAGAACCTCCAGCAACCCCCGCTGCTGTGGGACCTACTACAATGGCTTCAGGCGGTGGCGCACCAATGGCAGACAATAACGAAGGCGCCGACGGAGTGGGTAATGCCTCAGGAAATTGGCATTGCGATTCCACATGGCTGGGCGACAGAGTCATCACCACCAGCACCCGCACCTGGGCCTTGCCCACCTACAATAACCACCTCTACAAGCAAATCTCCAGTGCTTCAACGGGGGCCAGCAACGACAACCACTACTTCGGCTACAGCACCCCCTGGGGGTATTTTGATTTCAACAGATTCCACTGCCACTTTTCACCACGTGACTGGCAGCGACTCATCAACAACAATTGGGGATTCCGGCCCAAGAGACTCAACTTCAAACTCTTCAACATCCAAGTCAAGGAGGTCACGACGAATGATGGCGTCACAACCATCGCTAATAACCTTACCAGCACGGTTCAAGTCTTCTCGGACTCGGAGTACCAGCTTCCGTACGTCCTCGGCTCTGCGCACCAGGGCTGCCTCCCTCCGTTCCCGGCGGACGTGTTCATGATTCCGCAATACGGCTACCTGACGCTCAACAATGGCAGCCAAGCCGTGGGACGTTCATCCTTTTACTGCCTGGAATATTTCCCTTCTCAGATGCTGAGAACGGGCAACAACTTTACCTTCAGCTACACCTTTGAGGAAGTGCCTTTCCACAGCAGCTACGCGCACAGCCAGAGCCTGGACCGGCTGATGAATCCTCTCATCGACCAATACCTGTATTACCTGAACAGAACTCAAAATCAGTCCGGAAGTGCCCAAAACAAGGACTTGCTGTTTAGCCGTGGGTCTCCAGCTGGCATGTCTGTTCAGCCCAAAAACTGGCTACCTGGACCCTGTTATCGGCAGCAGCGCGTTTCTAAAACAAAAACAGACAACAACAACAGCAATTTTACCTGGACTGGTGCTTCAAAATATAACCTCAATGGGCGTGAATCCATCATCAACCCTGGCACTGCTATGGCCTCACACAAAGACGACGAAGACAAGTTCTTTCCCATGAGCGGTGTCATGATTTTTGGAAAAGAGAGCGCCGGAGCTTCAAACACTGCATTGGACAATGTCATGATTACAGACGAAGAGGAAATTAAAGCCACTAACCCTGGGCCACCGAAAGATTTGGGACCGTGGCAGTCAATTTCCAGAGCAGCAGCACAGACCCTGCGACCGGAGATGTGCATGCTATGGGAGCATTACCTGGCATGGTGTGGCAAGATAGAGACGTGTACCTGCAGGGTCCCATTTGGGCCAAAATTCCTCACACAGATGGACACTTTCACCCGTCTCCTCTTATGGGCGGCTTTGGACTCAAGAACCCGCCTCCTCAGATCCTCATCAAAAACACGCCTGTTCCTGCGAATCCTCCGGCGGAGTTTTCAGCTACAAAGTTTGCTTCATTCATCACCCAATACTCCACAGGACAAGTGAGTGTGGAAATTGAATGGGAGCTGCAGAAAGAAAACAGCAAGCGCTGGAATCCCGAAGTGCAGTACACATCCAATTATGCAAAATCTGCCAACGTTGATTTTACTGTGGACAACAATGGACTTTATACTGAGCCTCGCCCCATTGGCACCCGTTACCTTACCCG TCCCCTGTAA

Rep78: (SEQ ID NO: 2)

ATGCCGGGCTTCTACGAGATCGTGATCAAGGTGCCGAGCGACCTGGACGAGCACCTGCCGGGCATTTCTGACTCGTTTGTGAGCTGGGTGGCCGAGAAGGAATGGGAGCTGCCCCCGGATTCTGACATGGATCTGAATCTGATTGAGCAGGCACCCCTGACCGTGGCCGAGAAGCTGCAGCGCGACTTCCTGGTCCAATGGCGCCGCGTGAGTAAGGCCCCGGAGGCCCTCTTCTTTGTTCAGTTCGAGAAGGGCGAGTCCTACTTCCACCTCCATATTCTGGTGGAGACCACGGGGGTCAAATCCATGGTGCTGGGCCGCTTCCTGAGTCAGATTAGGGACAAGCTGGTGCAGACCATCTACCGCGGGATCGAGCCGACCCTGCCCAACTGGTTCGCGGTGACCAAGACGCGTAATGGCGCCGGAGGGGGGAACAAGGTGGTGGACGAGTGCTACATCCCCAACTACCTCCTGCCCAAGACTCAGCCCGAGCTGCAGTGGGCGTGGACTAACATGGAGGAGTATATAAGCGCCTGTTTGAACCTGGCCGAGCGCAAACGGCTCGTGGCGCAGCACCTGACCCACGTCAGCCAGACCCAGGAGCAGAACAAGGAGAATCTGAACCCCAATTCTGACGCGCCTGTCATCCGGTCAAAAACCTCCGCGCGCTACATGGAGCTGGTCGGGTGGCTGGTGGACCGGGGCATCACCTCCGAGAAGCAGTGGATCCAGGAGGACCAGGCCTCGTACATCTCCTTCAACGCCGCTTCCAACTCGCGGTCCCAGATCAAGGCCGCTCTGGACAATGCCGGCAAGATCATGGCGCTGACCAAATCCGCGCCCGACTACCTGGTAGGCCCCGCTCCGCCCGCGGACATTAAAACCAACCGCATCTACCGCATCCTGGAGCTGAACGGCTACGAACCTGCCTACGCCGGCTCCGTCTTTCTCGGCTGGGCCCAGAAAAGGTTCGGGAAGCGCAACACCATCTGGCTGTTTGGGCCGGCCACCACGGGCAAGACCAACATCGCGGAAGCCATCGCCCACGCCGTGCCCTTCTACGGCTGCGTCAACTGGACCAATGAGAACTTTCCCTTCAATGATTGCGTCGACAAGATGGTGATCTGGTGGGAGGAGGGCAAGATGACGGCCAAGGTCGTGGAGTCCGCCAAGGCCATTCTCGGCGGCAGCAAGGTGCGCGTGGACCAAAAGTGCAAGTCGTCCGCCCAGATCGACCCCACCCCCGTGATCGTCACCTCCAACACCAACATGTGCGCCGTGATTGACGGGAACAGCACCACCTTCGAGCACCAGCAGCCGTTGCAGGACCGGATGTTCAAATTTGAACTCACCCGCCGTCTGGAGCATGACTTTGGCAAGGTGACAAAGCAGGAAGTCAAAGAGTTCTTCCGCTGGGCGCAGGATCACGTGACCGAGGTGGCGCATGAGTTCTACGTCAGAAAGGGTGGAGCCAACAAAAGACCCGCCCCCGATGACGCGGATAAAAGCGAGCCCAAGCGGGCCTGCCCCTCAGTCGCGGATCCATCGACGTCAGACGCGGAAGGAGCTCCGGTGGACTTTGCCGACAGGTACCAAAACAAATGTTCTCGTCACGCGGGCATGCTTCAGATGCTGTTTCCCTGCAAGACATGCGAGAGAATGAATCAGAATTTCAACATTTGCTTCACGCACGGGACGAGAGACTGTTCAGAGTGCTTCCCCGGCGTGTCAGAATCTCAACCGGTCGTCAGAAAGAGGACGTATCGGAAACTCTGTGCCATTCATCATCTGCTGGGGCGGGCTCCCGAGATTGCTTGCTCGGCCTGCGATCTGGTCAACGTGGACCTGGATGACTGTGTTTCTGAGCAATAA

Aav-2 Full Genome: NC_001401

Rep78: (SEQ ID NO: 3)

ATGCCGGGGTTTTACGAGATTGTGATTAAGGTCCCCAGCGACCTTGACGAGCATCTGCCCGGCATTTCTGACAGCTTTGTGAACTGGGTGGCCGAGAAGGAATGGGAGTTGCCGCCAGATTCTGACATGGATCTGAATCTGATTGAGCAGGCACCCCTGACCGTGGCCGAGAAGCTGCAGCGCGACTTTCTGACGGAATGGCGCCGTGTGAGTAAGGCCCCGGAGGCCCTTTTCTTTGTGCAATTTGAGAAGGGAGAGAGCTACTTCCACATGCACGTGCTCGTGGAAACCACCGGGGTGAAATCCATGGTTTTGGGACGTTTCCTGAGTCAGATTCGCGAAAAACTGATTCAGAGAATTTACCGCGGGATCGAGCCGACTTTGCCAAACTGGTTCGCGGTCACAAAGACCAGAAATGGCGCCGGAGGCGGGAACAAGGTGGTGGATGAGTGCTACATCCCCAATTACTTGCTCCCCAAAACCCAGCCTGAGCTCCAGTGGGCGTGGACTAATATGGAACAGTATTTAAGCGCCTGTTTGAATCTCACGGAGCGTAAACGGTTGGTGGCGCAGCATCTGACGCACGTGTCGCAGACGCAGGAGCAGAACAAAGAGAATCAGAATCCCAATTCTGATGCGCCGGTGATCAGATCAAAAACTTCAGCCAGGTACATGGAGCTGGTCGGGTGGCTCGTGGACAAGGGGATTACCTCGGAGAAGCAGTGGATCCAGGAGGACCAGGCCTCATACATCTCCTTCAATGCGGCCTCCAACTCGCGGTCCCAAATCAAGGCTGCCTTGGACAATGCGGGAAAGATTATGAGCCTGACTAAAACCGCCCCCGACTACCTGGTGGGCCAGCAGCCCGTGGAGGACATTTCCAGCAATCGGATTTATAAAATTTTGGAACTAAACGGGTACGATCCCCAATATGCGGCTTCCGTCTTTCTGGGATGGGCCACGAAAAAGTTCGGCAAGAGGAACACCATCTGGCTGTTTGGGCCTGCAACTACCGGGAAGACCAACATCGCGGAGGCCATAGCCCACACTGTGCCCTTCTACGGGTGCGTAAACTGGACCAATGAGAACTTTCCCTTCAACGACTGTGTCGACAAGATGGTGATCTGGTGGGAGGAGGGGAAGATGACCGCCAAGGTCGTGGAGTCGGCCAAAGCCATTCTCGGAGGAAGCAAGGTGCGCGTGGACCAGAAATGCAAGTCCTCGGCCCAGATAGACCCGACTCCCGTGATCGTCACCTCCAACACCAACATGTGCGCCGTGATTGACGGGAACTCAACGACCTTCGAACACCAGCAGCCGTTGCAAGACCGGATGTTCAAATTTGAACTCACCCGCCGTCTGGATCATGACTTTGGGAAGGTCACCAAGCAGGAAGTCAAAGACTTTTTCCGGTGGGCAAAGGATCACGTGGTTGAGGTGGAGCATGAATTCTACGTCAAAAAGGGTGGAGCCAAGAAAAGACCCGCCCCCAGTGACGCAGATATAAGTGAGCCCAAACGGGTGCGCGAGTCAGTTGCGCAGCCATCGACGTCAGACGCGGAAGCTTCGATCAACTACGCAGACAGGTACCAAAACAAATGTTCTCGTCACGTGGGCATGAATCTGATGCTGTTTCCCTGCAGACAATGCGAGAGAATGAATCAGAATTCAAATATCTGCTTCACTCACGGACAGAAAGACTGTTTAGAGTGCTTTCCCGTGTCAGAATCTCAACCCGTTTCTGTCGTCAAAAAGGCGTATCAGAAACTGTGCTACATTCATCATATCATGGGAAAGGTGCCAGACGCTTGCACTGCCTGCGATCTGGTCAATGTGGATTTGGATGACTG CATCTTTGAACAATAA

Rep52: (SEQ ID NO: 4)

ATGGAGCTGGTCGGGTGGCTCGTGGACAAGGGGATTACCTCGGAGAAGCAGTGGATCCAGGAGGACCAGGCCTCATACATCTCCTTCAATGCGGCCTCCAACTCGCGGTCCCAAATCAAGGCTGCCTTGGACAATGCGGGAAAGATTATGAGCCTGACTAAAACCGCCCCCGACTACCTGGTGGGCCAGCAGCCCGTGGAGGACATTTCCAGCAATCGGATTTATAAAATTTTGGAACTAAACGGGTACGATCCCCAATATGCGGCTTCCGTCTTTCTGGGATGGGCCACGAAAAAGTTCGGCAAGAGGAACACCATCTGGCTGTTTGGGCCTGCAACTACCGGGAAGACCAACATCGCGGAGGCCATAGCCCACACTGTGCCCTTCTACGGGTGCGTAAACTGGACCAATGAGAACTTTCCCTTCAACGACTGTGTCGACAAGATGGTGATCTGGTGGGAGGAGGGGAAGATGACCGCCAAGGTCGTGGAGTCGGCCAAAGCCATTCTCGGAGGAAGCAAGGTGCGCGTGGACCAGAAATGCAAGTCCTCGGCCCAGATAGACCCGACTCCCGTGATCGTCACCTCCAACACCAACATGTGCGCCGTGATTGACGGGAACTCAACGACCTTCGAACACCAGCAGCCGTTGCAAGACCGGATGTTCAAATTTGAACTCACCCGCCGTCTGGATCATGACTTTGGGAAGGTCACCAAGCAGGAAGTCAAAGACTTTTTCCGGTGGGCAAAGGATCACGTGGTTGAGGTGGAGCATGAATTCTACGTCAAAAAGGGTGGAGCCAAGAAAAGACCCGCCCCCAGTGACGCAGATATAAGTGAGCCCAAACGGGTGCGCGAGTCAGTTGCGCAGCCATCGACGTCAGACGCGGAAGCTTCGATCAACTACGCAGACAGGTACCAAAACAAATGTTCTCGTCACGTGGGCATGAATCTGATGCTGTTTCCCTGCAGACAATGCGAGAGAATGAATCAGAATTCAAATATCTGCTTCACTCACGGACAGAAAGACTGTTTAGAGTGCTTTCCCGTGTCAGAATCTCAACCCGTTTCTGTCGTCAAAAAGGCGTATCAGAAACTGTGCTACATTCATCATATCATGGGAAAGGTGCCAGACGCTTGCACTGCCTGCGATCTGGTCAATGTGGATTTGGATGACTGCATCTTTGAACAATAA

CapVP1: (SEQ ID NO: 5)

ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACACTCTCTCTGAAGGAATAAGACAGTGGTGGAAGCTCAAACCTGGCCCACCACCACCAAAGCCCGCAGAGCGGCATAAGGACGACAGCAGGGGTCTTGTGCTTCCTGGGTACAAGTACCTCGGACCCTTCAACGGACTCGACAAGGGAGAGCCGGTCAACGAGGCAGACGCCGCGGCCCTCGAGCACGACAAAGCCTACGACCGGCAGCTCGACAGCGGAGACAACCCGTACCTCAAGTACAACCACGCCGACGCGGAGTTTCAGGAGCGCCTTAAAGAAGATACGTCTTTTGGGGGCAACCTCGGACGAGCAGTCTTCCAGGCGAAAAAGAGGGTTCTTGAACCTCTGGGCCTGGTTGAGGAACCTGTTAAGACGGCTCCGGGAAAAAAGAGGCCGGTAGAGCACTCTCCTGTGGAGCCAGACTCCTCCTCGGGAACCGGAAAGGCGGGCCAGCAGCCTGCAAGAAAAAGATTGAATTTTGGTCAGACTGGAGACGCAGACTCAGTACCTGACCCCCAGCCTCTCGGACAGCCACCAGCAGCCCCCTCTGGTCTGGGAACTAATACGATGGCTACAGGCAGTGGCGCACCAATGGCAGACAATAACGAGGGCGCCGACGGAGTGGGTAATTCCTCGGGAAATTGGCATTGCGATTCCACATGGATGGGCGACAGAGTCATCACCACCAGCACCCGAACCTGGGCCCTGCCCACCTACAACAACCACCTCTACAAACAAATTTCCAGCCAATCAGGAGCCTCGAACGACAATCACTACTTTGGCTACAGCACCCCTTGGGGGTATTTTGACTTCAACAGATTCCACTGCCACTTTTCACCACGTGACTGGCAAAGACTCATCAACAACAACTGGGGATTCCGACCCAAGAGACTCAACTTCAAGCTCTTTAACATTCAAGTCAAAGAGGTCACGCAGAATGACGGTACGACGACGATTGCCAATAACCTTACCAGCACGGTTCAGGTGTTTACTGACTCGGAGTACCAGCTCCCGTACGTCCTCGGCTCGGCGCATCAAGGATGCCTCCCGCCGTTCCCAGCAGACGTCTTCATGGTGCCACAGTATGGATACCTCACCCTGAACAACGGGAGTCAGGCAGTAGGACGCTCTTCATTTTACTGCCTGGAGTACTTTCCTTCTCAGATGCTGCGTACCGGAAACAACTTTACCTTCAGCTACACTTTTGAGGACGTTCCTTTCCACAGCAGCTACGCTCACAGCCAGAGTCTGGACCGTCTCATGAATCCTCTCATCGACCAGTACCTGTATTACTTGAGCAGAACAAACACTCCAAGTGGAACCACCACGCAGTCAAGGCTTCAGTTTTCTCAGGCCGGAGCGAGTGACATTCGGGACCAGTCTAGGAACTGGCTTCCTGGACCCTGTTACCGCCAGCAGCGAGTATCAAAGACATCTGCGGATAACAACAACAGTGAATACTCGTGGACTGGAGCTACCAAGTACCACCTCAATGGCAGAGACTCTCTGGTGAATCCGGGCCCGGCCATGGCAAGCCACAAGGACGATGAAGAAAAGTTTTTTCCTCAGAGCGGGGTTCTCATCTTTGGGAAGCAAGGCTCAGAGAAAACAAATGTGGACATTGAAAAGGTCATGATTACAGACGAAGAGGAAATCAGGACAACCAATCCCGTGGCTACGGAGCAGTATGGTTCTGTATCTACCAACCTCCAGAGAGGCAACAGACAAGCAGCTACCGCAGATGTCAACACACAAGGCGTTCTTCCAGGCATGGTCTGGCAGGACAGAGATGTGTACCTTCAGGGGCCCATCTGGGCAAAGATTCCACACACGGACGGACATTTTCACCCCTCTCCCCTCATGGGTGGATTCGGACTTAAACACCCTCCTCCACAGATTCTCATCAAGAACACCCCGGTACCTGCGAATCCTTCGACCACCTTCAGTGCGGCAAAGTTTGCTTCCTTCATCACACAGTACTCCACGGGACAGGTCAGCGTGGAGATCGAGTGGGAGCTGCAGAAGGAAAACAGCAAACGCTGGAATCCCGAAATTCAGTACACTTCCAACTACAACAAGTCTGTTAATGTGGACTTTACTGTGGACACTAATGGCGTGTATTCAGAGCCTCGCCCCATTGGCACCAGATACCTGACTCGTA ATCTGTAA

CapVP2: (SEQ ID NO: 6)

ACGGCTCCGGGAAAAAAGAGGCCGGTAGAGCACTCTCCTGTGGAGCCAGACTCCTCCTCGGGAACCGGAAAGGCGGGCCAGCAGCCTGCAAGAAAAAGATTGAATTTTGGTCAGACTGGAGACGCAGACTCAGTACCTGACCCCCAGCCTCTCGGACAGCCACCAGCAGCCCCCTCTGGTCTGGGAACTAATACGATGGCTACAGGCAGTGGCGCACCAATGGCAGACAATAACGAGGGCGCCGACGGAGTGGGTAATTCCTCGGGAAATTGGCATTGCGATTCCACATGGATGGGCGACAGAGTCATCACCACCAGCACCCGAACCTGGGCCCTGCCCACCTACAACAACCACCTCTACAAACAAATTTCCAGCCAATCAGGAGCCTCGAACGACAATCACTACTTTGGCTACAGCACCCCTTGGGGGTATTTTGACTTCAACAGATTCCACTGCCACTTTTCACCACGTGACTGGCAAAGACTCATCAACAACAACTGGGGATTCCGACCCAAGAGACTCAACTTCAAGCTCTTTAACATTCAAGTCAAAGAGGTCACGCAGAATGACGGTACGACGACGATTGCCAATAACCTTACCAGCACGGTTCAGGTGTTTACTGACTCGGAGTACCAGCTCCCGTACGTCCTCGGCTCGGCGCATCAAGGATGCCTCCCGCCGTTCCCAGCAGACGTCTTCATGGTGCCACAGTATGGATACCTCACCCTGAACAACGGGAGTCAGGCAGTAGGACGCTCTTCATTTTACTGCCTGGAGTACTTTCCTTCTCAGATGCTGCGTACCGGAAACAACTTTACCTTCAGCTACACTTTTGAGGACGTTCCTTTCCACAGCAGCTACGCTCACAGCCAGAGTCTGGACCGTCTCATGAATCCTCTCATCGACCAGTACCTGTATTACTTGAGCAGAACAAACACTCCAAGTGGAACCACCACGCAGTCAAGGCTTCAGTTTTCTCAGGCCGGAGCGAGTGACATTCGGGACCAGTCTAGGAACTGGCTTCCTGGACCCTGTTACCGCCAGCAGCGAGTATCAAAGACATCTGCGGATAACAACAACAGTGAATACTCGTGGACTGGAGCTACCAAGTACCACCTCAATGGCAGAGACTCTCTGGTGAATCCGGGCCCGGCCATGGCAAGCCACAAGGACGATGAAGAAAAGTTTTTTCCTCAGAGCGGGGTTCTCATCTTTGGGAAGCAAGGCTCAGAGAAAACAAATGTGGACATTGAAAAGGTCATGATTACAGACGAAGAGGAAATCAGGACAACCAATCCCGTGGCTACGGAGCAGTATGGTTCTGTATCTACCAACCTCCAGAGAGGCAACAGACAAGCAGCTACCGCAGATGTCAACACACAAGGCGTTCTTCCAGGCATGGTCTGGCAGGACAGAGATGTGTACCTTCAGGGGCCCATCTGGGCAAAGATTCCACACACGGACGGACATTTTCACCCCTCTCCCCTCATGGGTGGATTCGGACTTAAACACCCTCCTCCACAGATTCTCATCAAGAACACCCCGGTACCTGCGAATCCTTCGACCACCTTCAGTGCGGCAAAGTTTGCTTCCTTCATCACACAGTACTCCACGGGACAGGTCAGCGTGGAGATCGAGTGGGAGCTGCAGAAGGAAAACAGCAAACGCTGGAATCCCGAAATTCAGTACACTTCCAACTACAACAAGTCTGTTAATGTGGACTTTACTGTGGACACTAATGGCGTGTATTCAGAGCCTCGCCCCATTGGCACCAGATACCTGACTCGTAATCTGTAA

CapVP3: (SEQ ID NO: 7)

ATGGCTACAGGCAGTGGCGCACCAATGGCAGACAATAACGAGGGCGCCGACGGAGTGGGTAATTCCTCGGGAAATTGGCATTGCGATTCCACATGGATGGGCGACAGAGTCATCACCACCAGCACCCGAACCTGGGCCCTGCCCACCTACAACAACCACCTCTACAAACAAATTTCCAGCCAATCAGGAGCCTCGAACGACAATCACTACTTTGGCTACAGCACCCCTTGGGGGTATTTTGACTTCAACAGATTCCACTGCCACTTTTCACCACGTGACTGGCAAAGACTCATCAACAACAACTGGGGATTCCGACCCAAGAGACTCAACTTCAAGCTCTTTAACATTCAAGTCAAAGAGGTCACGCAGAATGACGGTACGACGACGATTGCCAATAACCTTACCAGCACGGTTCAGGTGTTTACTGACTCGGAGTACCAGCTCCCGTACGTCCTCGGCTCGGCGCATCAAGGATGCCTCCCGCCGTTCCCAGCAGACGTCTTCATGGTGCCACAGTATGGATACCTCACCCTGAACAACGGGAGTCAGGCAGTAGGACGCTCTTCATTTTACTGCCTGGAGTACTTTCCTTCTCAGATGCTGCGTACCGGAAACAACTTTACCTTCAGCTACACTTTTGAGGACGTTCCTTTCCACAGCAGCTACGCTCACAGCCAGAGTCTGGACCGTCTCATGAATCCTCTCATCGACCAGTACCTGTATTACTTGAGCAGAACAAACACTCCAAGTGGAACCACCACGCAGTCAAGGCTTCAGTTTTCTCAGGCCGGAGCGAGTGACATTCGGGACCAGTCTAGGAACTGGCTTCCTGGACCCTGTTACCGCCAGCAGCGAGTATCAAAGACATCTGCGGATAACAACAACAGTGAATACTCGTGGACTGGAGCTACCAAGTACCACCTCAATGGCAGAGACTCTCTGGTGAATCCGGGCCCGGCCATGGCAAGCCACAAGGACGATGAAGAAAAGTTTTTTCCTCAGAGCGGGGTTCTCATCTTTGGGAAGCAAGGCTCAGAGAAAACAAATGTGGACATTGAAAAGGTCATGATTACAGACGAAGAGGAAATCAGGACAACCAATCCCGTGGCTACGGAGCAGTATGGTTCTGTATCTACCAACCTCCAGAGAGGCAACAGACAAGCAGCTACCGCAGATGTCAACACACAAGGCGTTCTTCCAGGCATGGTCTGGCAGGACAGAGATGTGTACCTTCAGGGGCCCATCTGGGCAAAGATTCCACACACGGACGGACATTTTCACCCCTCTCCCCTCATGGGTGGATTCGGACTTAAACACCCTCCTCCACAGATTCTCATCAAGAACACCCCGGTACCTGCGAATCCTTCGACCACCTTCAGTGCGGCAAAGTTTGCTTCCTTCATCACACAGTACTCCACGGGACAGGTCAGCGTGGAGATCGAGTGGGAGCTGCAGAAGGAAAACAGCAAACGCTGGAATCCCGAAATTCAGTACACTTCCAACTACAACAAGTCTGTTAATGTGGACTTTACTGTGGACACTAATGGCGTGTATTCAGAGCCTCGCCCCATTGGCACCAGATACCTGACTCGTAATCTGT AA

CapAAP: (SEQ ID NO: 8)

CTGGAGACGCAGACTCAGTACCTGACCCCCAGCCTCTCGGACAGCCACCAGCAGCCCCCTCTGGTCTGGGAACTAATACGATGGCTACAGGCAGTGGCGCACCAATGGCAGACAATAACGAGGGCGCCGACGGAGTGGGTAATTCCTCGGGAAATTGGCATTGCGATTCCACATGGATGGGCGACAGAGTCATCACCACCAGCACCCGAACCTGGGCCCTGCCCACCTACAACAACCACCTCTACAAACAAATTTCCAGCCAATCAGGAGCCTCGAACGACAATCACTACTTTGGCTACAGCACCCCTTGGGGGTATTTTGACTTCAACAGATTCCACTGCCACTTTTCACCACGTGACTGGCAAAGACTCATCAACAACAACTGGGGATTCCGACCCAAGAGACTCAACTTCAAGCTCTTTAACATTCAAGTCAAAGAGGTCACGCAGAATGACGGTACGACGACGATTGCCAATAACCTTACCAGCACGGTTCAGGTGTTTACTGACTCGGAGTACCAGCTCCCGTACGTCCTCGGCTCGGCGCATCAAGGATGCCTCCCGCCGTTCCCAGCAGACGTCTTCATGGTGCCACAGTATG GATACCTCACCCTGA

Aav-3 Full Genome: NC_001729

Rep78: (SEQ ID NO: 9)

ATGCCGGGGTTCTACGAGATTGTCCTGAAGGTCCCGAGTGACCTGGACGAGCGCCTGCCGGGCATTTCTAACTCGTTTGTTAACTGGGTGGCCGAGAAGGAATGGGACGTGCCGCCGGATTCTGACATGGATCCGAATCTGATTGAGCAGGCACCCCTGACCGTGGCCGAAAAGCTTCAGCGCGAGTTCCTGGTGGAGTGGCGCCGCGTGAGTAAGGCCCCGGAGGCCCTCTTTTTTGTCCAGTTCGAAAAGGGGGAGACCTACTTCCACCTGCACGTGCTGATTGAGACCATCGGGGTCAAATCCATGGTGGTCGGCCGCTACGTGAGCCAGATTAAAGAGAAGCTGGTGACCCGCATCTACCGCGGGGTCGAGCCGCAGCTTCCGAACTGGTTCGCGGTGACCAAAACGCGAAATGGCGCCGGGGGCGGGAACAAGGTGGTGGACGACTGCTACATCCCCAACTACCTGCTCCCCAAGACCCAGCCCGAGCTCCAGTGGGCGTGGACTAACATGGACCAGTATTTAAGCGCCTGTTTGAATCTCGCGGAGCGTAAACGGCTGGTGGCGCAGCATCTGACGCACGTGTCGCAGACGCAGGAGCAGAACAAAGAGAATCAGAACCCCAATTCTGACGCGCCGGTCATCAGGTCAAAAACCTCAGCCAGGTACATGGAGCTGGTCGGGTGGCTGGTGGACCGCGGGATCACGTCAGAAAAGCAATGGATTCAGGAGGACCAGGCCTCGTACATCTCCTTCAACGCCGCCTCCAACTCGCGGTCCCAGATCAAGGCCGCGCTGGACAATGCCTCCAAGATCATGAGCCTGACAAAGACGGCTCCGGACTACCTGGTGGGCAGCAACCCGCCGGAGGACATTACCAAAAATCGGATCTACCAAATCCTGGAGCTGAACGGGTACGATCCGCAGTACGCGGCCTCCGTCTTCCTGGGCTGGGCGCAAAAGAAGTTCGGGAAGAGGAACACCATCTGGCTCTTTGGGCCGGCCACGACGGGTAAAACCAACATCGCGGAAGCCATCGCCCACGCCGTGCCCTTCTACGGCTGCGTAAACTGGACCAATGAGAACTTTCCCTTCAACGATTGCGTCGACAAGATGGTGATCTGGTGGGAGGAGGGCAAGATGACGGCCAAGGTCGTGGAGAGCGCCAAGGCCATTCTGGGCGGAAGCAAGGTGCGCGTGGACCAAAAGTGCAAGTCATCGGCCCAGATCGAACCCACTCCCGTGATCGTCACCTCCAACACCAACATGTGCGCCGTGATTGACGGGAACAGCACCACCTTCGAGCATCAGCAGCCGCTGCAGGACCGGATGTTTGAATTTGAACTTACCCGCCGTTTGGACCATGACTTTGGGAAGGTCACCAAACAGGAAGTAAAGGACTTTTTCCGGTGGGCTTCCGATCACGTGACTGACGTGGCTCATGAGTTCTACGTCAGAAAGGGTGGAGCTAAGAAACGCCCCGCCTCCAATGACGCGGATGTAAGCGAGCCAAAACGGGAGTGCACGTCACTTGCGCAGCCGACAACGTCAGACGCGGAAGCACCGGCGGACTACGCGGACAGGTACCAAAACAAATGTTCTCGTCACGTGGGCATGAATCTGATGCTTTTTCCCTGTAAAACATGCGAGAGAATGAATCAAATTTCCAATGTCTGTTTTACGCATGGTCAAAGAGACTGTGGGGAATGCTTCCCTGGAATGTCAGAATCTCAACCCGTTTCTGTCGTCAAAAAGAAGACTTATCAGAAACTGTGTCCAATTCATCATATCCTGGGAAGGGCACCCGAGATTGCCTGTTCGGCCTGCGATTTGGCCAATGTGGACTTGGATGACTGTGTTTCTGAGCAATAA

CapVP1: (SEQ ID NO:10)

ATGGCTGCTGACGGTTATCTTCCAGATTGGCTCGAGGACAACCTTTCTGAAGGCATTCGTGAGTGGTGGGCTCTGAAACCTGGAGTCCCTCAACCCAAAGCGAACCAACAACACCAGGACAACCGTCGGGGTCTTGTGCTTCCGGGTTACAAATACCTCGGACCCGGTAACGGACTCGACAAAGGAGAGCCGGTCAACGAGGCGGACGCGGCAGCCCTCGAACACGACAAAGCTTACGACCAGCAGCTCAAGGCCGGTGACAACCCGTACCTCAAGTACAACCACGCCGACGCCGAGTTTCAGGAGCGTCTTCAAGAAGATACGTCTTTTGGGGGCAACCTTGGCAGAGCAGTCTTCCAGGCCAAAAAGAGGATCCTTGAGCCTCTTGGTCTGGTTGAGGAAGCAGCTAAAACGGCTCCTGGAAAGAAGGGGGCTGTAGATCAGTCTCCTCAGGAACCGGACTCATCATCTGGTGTTGGCAAATCGGGCAAACAGCCTGCCAGAAAAAGACTAAATTTCGGTCAGACTGGAGACTCAGAGTCAGTCCCAGACCCTCAACCTCTCGGAGAACCACCAGCAGCCCCCACAAGTTTGGGATCTAATACAATGGCTTCAGGCGGTGGCGCACCAATGGCAGACAATAACGAGGGTGCCGATGGAGTGGGTAATTCCTCAGGAAATTGGCATTGCGATTCCCAATGGCTGGGCGACAGAGTCATCACCACCAGCACCAGAACCTGGGCCCTGCCCACTTACAACAACCATCTCTACAAGCAAATCTCCAGCCAATCAGGAGCTTCAAACGACAACCACTACTTTGGCTACAGCACCCCTTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTCTCACCACGTGACTGGCAGCGACTCATTAACAACAACTGGGGATTCCGGCCCAAGAAACTCAGCTTCAAGCTCTTCAACATCCAAGTTAGAGGGGTCACGCAGAACGATGGCACGACGACTATTGCCAATAACCTTACCAGCACGGTTCAAGTGTTTACGGACTCGGAGTATCAGCTCCCGTACGTGCTCGGGTCGGCGCACCAAGGCTGTCTCCCGCCGTTTCCAGCGGACGTCTTCATGGTCCCTCAGTATGGATACCTCACCCTGAACAACGGAAGTCAAGCGGTGGGACGCTCATCCTTTTACTGCCTGGAGTACTTCCCTTCGCAGATGCTAAGGACTGGAAATAACTTCCAATTCAGCTATACCTTCGAGGATGTACCTTTTCACAGCAGCTACGCTCACAGCCAGAGTTTGGATCGCTTGATGAATCCTCTTATTGATCAGTATCTGTACTACCTGAACAGAACGCAAGGAACAACCTCTGGAACAACCAACCAATCACGGCTGCTTTTTAGCCAGGCTGGGCCTCAGTCTATGTCTTTGCAGGCCAGAAATTGGCTACCTGGGCCCTGCTACCGGCAACAGAGACTTTCAAAGACTGCTAACGACAACAACAACAGTAACTTTCCTTGGACAGCGGCCAGCAAATATCATCTCAATGGCCGCGACTCGCTGGTGAATCCAGGACCAGCTATGGCCAGTCACAAGGACGATGAAGAAAAATTTTTCCCTATGCACGGCAATCTAATATTTGGCAAAGAAGGGACAACGGCAAGTAACGCAGAATTAGATAATGTAATGATTACGGATGAAGAAGAGATTCGTACCACCAATCCTGTGGCAACAGAGCAGTATGGAACTGTGGCAAATAACTTGCAGAGCTCAAATACAGCTCCCACGACTGGAACTGTCAATCATCAGGGGGCCTTACCTGGCATGGTGTGGCAAGATCGTGACGTGTACCTTCAAGGACCTATCTGGGCAAAGATTCCTCACACGGATGGACACTTTCATCCTTCTCCTCTGATGGGAGGCTTTGGACTGAAACATCCGCCTCCTCAAATCATGATCAAAAATACTCCGGTACCGGCAAATCCTCCGACGACTTTCAGCCCGGCCAAGTTTGCTTCATTTATCACTCAGTACTCCACTGGACAGGTCAGCGTGGAAATTGAGTGGGAGCTACAGAAAGAAAACAGCAAACGTTGGAATCCAGAGATTCAGTACACTTCCAACTACAACAAGTCTGTTAATGTGGACTTTACTGTAGACACTAATGGTGTTTATAGTGAACCTCGCCCTATTGGAACCCGGTATCTCACAC GAAACTTGTGA

Aav-4 Full Genome: NC_001829

Rep78: (SEQ ID NO:11)

ATGCCGGGGTTCTACGAGATCGTGCTGAAGGTGCCCAGCGACCTGGACGAGCACCTGCCCGGCATTTCTGACTCTTTTGTGAGCTGGGTGGCCGAGAAGGAATGGGAGCTGCCGCCGGATTCTGACATGGACTTGAATCTGATTGAGCAGGCACCCCTGACCGTGGCCGAAAAGCTGCAACGCGAGTTCCTGGTCGAGTGGCGCCGCGTGAGTAAGGCCCCGGAGGCCCTCTTCTTTGTCCAGTTCGAGAAGGGGGACAGCTACTTCCACCTGCACATCCTGGTGGAGACCGTGGGCGTCAAATCCATGGTGGTGGGCCGCTACGTGAGCCAGATTAAAGAGAAGCTGGTGACCCGCATCTACCGCGGGGTCGAGCCGCAGCTTCCGAACTGGTTCGCGGTGACCAAGACGCGTAATGGCGCCGGAGGCGGGAACAAGGTGGTGGACGACTGCTACATCCCCAACTACCTGCTCCCCAAGACCCAGCCCGAGCTCCAGTGGGCGTGGACTAACATGGACCAGTATATAAGCGCCTGTTTGAATCTCGCGGAGCGTAAACGGCTGGTGGCGCAGCATCTGACGCACGTGTCGCAGACGCAGGAGCAGAACAAGGAAAACCAGAACCCCAATTCTGACGCGCCGGTCATCAGGTCAAAAACCTCCGCCAGGTACATGGAGCTGGTCGGGTGGCTGGTGGACCGCGGGATCACGTCAGAAAAGCAATGGATCCAGGAGGACCAGGCGTCCTACATCTCCTTCAACGCCGCCTCCAACTCGCGGTCACAAATCAAGGCCGCGCTGGACAATGCCTCCAAAATCATGAGCCTGACAAAGACGGCTCCGGACTACCTGGTGGGCCAGAACCCGCCGGAGGACATTTCCAGCAACCGCATCTACCGAATCCTCGAGATGAACGGGTACGATCCGCAGTACGCGGCCTCCGTCTTCCTGGGCTGGGCGCAAAAGAAGTTCGGGAAGAGGAACACCATCTGGCTCTTTGGGCCGGCCACGACGGGTAAAACCAACATCGCGGAAGCCATCGCCCACGCCGTGCCCTTCTACGGCTGCGTGAACTGGACCAATGAGAACTTTCCGTTCAACGATTGCGTCGACAAGATGGTGATCTGGTGGGAGGAGGGCAAGATGACGGCCAAGGTCGTAGAGAGCGCCAAGGCCATCCTGGGCGGAAGCAAGGTGCGCGTGGACCAAAAGTGCAAGTCATCGGCCCAGATCGACCCAACTCCCGTGATCGTCACCTCCAACACCAACATGTGCGCGGTCATCGACGGAAACTCGACCACCTTCGAGCACCAACAACCACTCCAGGACCGGATGTTCAAGTTCGAGCTCACCAAGCGCCTGGAGCACGACTTTGGCAAGGTCACCAAGCAGGAAGTCAAAGACTTTTTCCGGTGGGCGTCAGATCACGTGACCGAGGTGACTCACGAGTTTTACGTCAGAAAGGGTGGAGCTAGAAAGAGGCCCGCCCCCAATGACGCAGATATAAGTGAGCCCAAGCGGGCCTGTCCGTCAGTTGCGCAGCCATCGACGTCAGACGCGGAAGCTCCGGTGGACTACGCGGACAGGTACCAAAACAAATGTTCTCGTCACGTGGGTATGAATCTGATGCTTTTTCCCTGCCGGCAATGCGAGAGAATGAATCAGAATGTGGACATTTGCTTCACGCACGGGGTCATGGACTGTGCCGAGTGCTTCCCCGTGTCAGAATCTCAACCCGTGTCTGTCGTCAGAAAGCGGACGTATCAGAAACTGTGTCCGATTCATCACATCATGGGGAGGGCGCCCGAGGTGGCCTGCTCGGCCTGCGAACTGGCCAATGTGGACTTGGATGACTGTGACATGGAACAATAA

CapVP1: (SEQ ID NO:12)

ATGACTGACGGTTACCTTCCAGATTGGCTAGAGGACAACCTCTCTGAAGGCGTTCGAGAGTGGTGGGCGCTGCAACCTGGAGCCCCTAAACCCAAGGCAAATCAACAACATCAGGACAACGCTCGGGGTCTTGTGCTTCCGGGTTACAAATACCTCGGACCCGGCAACGGACTCGACAAGGGGGAACCCGTCAACGCAGCGGACGCGGCAGCCCTCGAGCACGACAAGGCCTACGACCAGCAGCTCAAGGCCGGTGACAACCCCTACCTCAAGTACAACCACGCCGACGCGGAGTTCCAGCAGCGGCTTCAGGGCGACACATCGTTTGGGGGCAACCTCGGCAGAGCAGTCTTCCAGGCCAAAAAGAGGGTTCTTGAACCTCTTGGTCTGGTTGAGCAAGCGGGTGAGACGGCTCCTGGAAAGAAGAGACCGTTGATTGAATCCCCCCAGCAGCCCGACTCCTCCACGGGTATCGGCAAAAAAGGCAAGCAGCCGGCTAAAAAGAAGCTCGTTTTCGAAGACGAAACTGGAGCAGGCGACGGACCCCCTGAGGGATCAACTTCCGGAGCCATGTCTGATGACAGTGAGATGCGTGCAGCAGCTGGCGGAGCTGCAGTCGAGGGCGGACAAGGTGCCGATGGAGTGGGTAATGCCTCGGGTGATTGGCATTGCGATTCCACCTGGTCTGAGGGCCACGTCACGACCACCAGCACCAGAACCTGGGTCTTGCCCACCTACAACAACCACCTCTACAAGCGACTCGGAGAGAGCCTGCAGTCCAACACCTACAACGGATTCTCCACCCCCTGGGGATACTTTGACTTCAACCGCTTCCACTGCCACTTCTCACCACGTGACTGGCAGCGACTCATCAACAACAACTGGGGCATGCGACCCAAAGCCATGCGGGTCAAAATCTTCAACATCCAGGTCAAGGAGGTCACGACGTCGAACGGCGAGACAACGGTGGCTAATAACCTTACCAGCACGGTTCAGATCTTTGCGGACTCGTCGTACGAACTGCCGTACGTGATGGATGCGGGTCAAGAGGGCAGCCTGCCTCCTTTTCCCAACGACGTCTTTATGGTGCCCCAGTACGGCTACTGTGGACTGGTGACCGGCAACACTTCGCAGCAACAGACTGACAGAAATGCCTTCTACTGCCTGGAGTACTTTCCTTCGCAGATGCTGCGGACTGGCAACAACTTTGAAATTACGTACAGTTTTGAGAAGGTGCCTTTCCACTCGATGTACGCGCACAGCCAGAGCCTGGACCGGCTGATGAACCCTCTCATCGACCAGTACCTGTGGGGACTGCAATCGACCACCACCGGAACCACCCTGAATGCCGGGACTGCCACCACCAACTTTACCAAGCTGCGGCCTACCAACTTTTCCAACTTTAAAAAGAACTGGCTGCCCGGGCCTTCAATCAAGCAGCAGGGCTTCTCAAAGACTGCCAATCAAAACTACAAGATCCCTGCCACCGGGTCAGACAGTCTCATCAAATACGAGACGCACAGCACTCTGGACGGAAGATGGAGTGCCCTGACCCCCGGACCTCCAATGGCCACGGCTGGACCTGCGGACAGCAAGTTCAGCAACAGCCAGCTCATCTTTGCGGGGCCTAAACAGAACGGCAACACGGCCACCGTACCCGGGACTCTGATCTTCACCTCTGAGGAGGAGCTGGCAGCCACCAACGCCACCGATACGGACATGTGGGGCAACCTACCTGGCGGTGACCAGAGCAACAGCAACCTGCCGACCGTGGACAGACTGACAGCCTTGGGAGCCGTGCCTGGAATGGTCTGGCAAAACAGAGACATTTACTACCAGGGTCCCATTTGGGCCAAGATTCCTCATACCGATGGACACTTTCACCCCTCACCGCTGATTGGTGGGTTTGGGCTGAAACACCCGCCTCCTCAAATTTTTATCAAGAACACCCCGGTACCTGCGAATCCTGCAACGACCTTCAGCTCTACTCCGGTAAACTCCTTCATTACTCAGTACAGCACTGGCCAGGTGTCGGTGCAGATTGACTGGGAGATCCAGAAGGAGCGGTCCAAACGCTGGAACCCCGAGGTCCAGTTTACCTCCAACTACGGACAGCAAAACTCTCTGTTGTGGGCTCCCGATGCGGCTGGGAAATACACTGAGCCTAGGGCTATCGGTACCCGCTACCTCACCCACCACC TGTAA

Aav-5 Full Genome: NC_006152

Rep78: (SEQ ID NO:13)

ATGGCTACCTTCTATGAAGTCATTGTTCGCGTCCCATTTGACGTGGAGGAACATCTGCCTGGAATTTCTGACAGCTTTGTGGACTGGGTAACTGGTCAAATTTGGGAGCTGCCTCCAGAGTCAGATTTAAATTTGACTCTGGTTGAACAGCCTCAGTTGACGGTGGCTGATAGAATTCGCCGCGTGTTCCTGTACGAGTGGAACAAATTTTCCAAGCAGGAGTCCAAATTCTTTGTGCAGTTTGAAAAGGGATCTGAATATTTTCATCTGCACACGCTTGTGGAGACCTCCGGCATCTCTTCCATGGTCCTCGGCCGCTACGTGAGTCAGATTCGCGCCCAGCTGGTGAAAGTGGTCTTCCAGGGAATTGAACCCCAGATCAACGACTGGGTCGCCATCACCAAGGTAAAGAAGGGCGGAGCCAATAAGGTGGTGGATTCTGGGTATATTCCCGCCTACCTGCTGCCGAAGGTCCAACCGGAGCTTCAGTGGGCGTGGACAAACCTGGACGAGTATAAATTGGCCGCCCTGAATCTGGAGGAGCGCAAACGGCTCGTCGCGCAGTTTCTGGCAGAATCCTCGCAGCGCTCGCAGGAGGCGGCTTCGCAGCGTGAGTTCTCGGCTGACCCGGTCATCAAAAGCAAGACTTCCCAGAAATACATGGCGCTCGTCAACTGGCTCGTGGAGCACGGCATCACTTCCGAGAAGCAGTGGATCCAGGAAAATCAGGAGAGCTACCTCTCCTTCAACTCCACCGGCAACTCTCGGAGCCAGATCAAGGCCGCGCTCGACAACGCGACCAAAATTATGAGTCTGACAAAAAGCGCGGTGGACTACCTCGTGGGGAGCTCCGTTCCCGAGGACATTTCAAAAAACAGAATCTGGCAAATTTTTGAGATGAATGGCTACGACCCGGCCTACGCGGGATCCATCCTCTACGGCTGGTGTCAGCGCTCCTTCAACAAGAGGAACACCGTCTGGCTCTACGGACCCGCCACGACCGGCAAGACCAACATCGCGGAGGCCATCGCCCACACTGTGCCCTTTTACGGCTGCGTGAACTGGACCAATGAAAACTTTCCCTTTAATGACTGTGTGGACAAAATGCTCATTTGGTGGGAGGAGGGAAAGATGACCAACAAGGTGGTTGAATCCGCCAAGGCCATCCTGGGGGGCTCAAAGGTGCGGGTCGATCAGAAATGTAAATCCTCTGTTCAAATTGATTCTACCCCTGTCATTGTAACTTCCAATACAAACATGTGTGTGGTGGTGGATGGGAATTCCACGACCTTTGAACACCAGCAGCCGCTGGAGGACCGCATGTTCAAATTTGAACTGACTAAGCGGCTCCCGCCAGATTTTGGCAAGATTACTAAGCAGGAAGTCAAGGACTTTTTTGCTTGGGCAAAGGTCAATCAGGTGCCGGTGACTCACGAGTTTAAAGTTCCCAGGGAATTGGCGGGAACTAAAGGGGCGGAGAAATCTCTAAAACGCCCACTGGGTGACGTCACCAATACTAGCTATAAAAGTCTGGAGAAGCGGGCCAGGCTCTCATTTGTTCCCGAGACGCCTCGCAGTTCAGACGTGACTGTTGATCCCGCTCCTCTGCGACCGCTCAATTGGAATTCAAGGTATGATTGCAAATGTGACTATCATGCTCAATTTGACAACATTTCTAACAAATGTGATGAATGTGAATATTTGAATCGGGGCAAAAATGGATGTATCTGTCACAATGTAACTCACTGTCAAATTTGTCATGGGATTCCCCCCTGGGAAAAGGAAAACTTGTCAGATTTTGGGGATTTTGACGATGCCAATAAAGAACAGTAA

CapVP1: (SEQ ID NO:14)

ATGTCTTTTGTTGATCACCCTCCAGATTGGTTGGAAGAAGTTGGTGAAGGTCTTCGCGAGTTTTTGGGCCTTGAAGCGGGCCCACCGAAACCAAAACCCAATCAGCAGCATCAAGATCAAGCCCGTGGTCTTGTGCTGCCTGGTTATAACTATCTCGGACCCGGAAACGGTCTCGATCGAGGAGAGCCTGTCAACAGGGCAGACGAGGTCGCGCGAGAGCACGACATCTCGTACAACGAGCAGCTTGAGGCGGGAGACAACCCCTACCTCAAGTACAACCACGCGGACGCCGAGTTTCAGGAGAAGCTCGCCGACGACACATCCTTCGGGGGAAACCTCGGAAAGGCAGTCTTTCAGGCCAAGAAAAGGGTTCTCGAACCTTTTGGCCTGGTTGAAGAGGGTGCTAAGACGGCCCCTACCGGAAAGCGGATAGACGACCACTTTCCAAAAAGAAAGAAGGCTCGGACCGAAGAGGACTCCAAGCCTTCCACCTCGTCAGACGCCGAAGCTGGACCCAGCGGATCCCAGCAGCTGCAAATCCCAGCCCAACCAGCCTCAAGTTTGGGAGCTGATACAATGTCTGCGGGAGGTGGCGGCCCATTGGGCGACAATAACCAAGGTGCCGATGGAGTGGGCAATGCCTCGGGAGATTGGCATTGCGATTCCACGTGGATGGGGGACAGAGTCGTCACCAAGTCCACCCGAACCTGGGTGCTGCCCAGCTACAACAACCACCAGTACCGAGAGATCAAAAGCGGCTCCGTCGACGGAAGCAACGCCAACGCCTACTTTGGATACAGCACCCCCTGGGGGTACTTTGACTTTAACCGCTTCCACAGCCACTGGAGCCCCCGAGACTGGCAAAGACTCATCAACAACTACTGGGGCTTCAGACCCCGGTCCCTCAGAGTCAAAATCTTCAACATTCAAGTCAAAGAGGTCACGGTGCAGGACTCCACCACCACCATCGCCAACAACCTCACCTCCACCGTCCAAGTGTTTACGGACGACGACTACCAGCTGCCCTACGTCGTCGGCAACGGGACCGAGGGATGCCTGCCGGCCTTCCCTCCGCAGGTCTTTACGCTGCCGCAGTACGGTTACGCGACGCTGAACCGCGACAACACAGAAAATCCCACCGAGAGGAGCAGCTTCTTCTGCCTAGAGTACTTTCCCAGCAAGATGCTGAGAACGGGCAACAACTTTGAGTTTACCTACAACTTTGAGGAGGTGCCCTTCCACTCCAGCTTCGCTCCCAGTCAGAACCTGTTCAAGCTGGCCAACCCGCTGGTGGACCAGTACTTGTACCGCTTCGTGAGCACAAATAACACTGGCGGAGTCCAGTTCAACAAGAACCTGGCCGGGAGATACGCCAACACCTACAAAAACTGGTTCCCGGGGCCCATGGGCCGAACCCAGGGCTGGAACCTGGGCTCCGGGGTCAACCGCGCCAGTGTCAGCGCCTTCGCCACGACCAATAGGATGGAGCTCGAGGGCGCGAGTTACCAGGTGCCCCCGCAGCCGAACGGCATGACCAACAACCTCCAGGGCAGCAACACCTATGCCCTGGAGAACACTATGATCTTCAACAGCCAGCCGGCGAACCCGGGCACCACCGCCACGTACCTCGAGGGCAACATGCTCATCACCAGCGAGAGCGAGACGCAGCCGGTGAACCGCGTGGCGTACAACGTCGGCGGGCAGATGGCCACCAACAACCAGAGCTCCACCACTGCCCCCGCGACCGGCACGTACAACCTCCAGGAAATCGTGCCCGGCAGCGTGTGGATGGAGAGGGACGTGTACCTCCAAGGACCCATCTGGGCCAAGATCCCAGAGACGGGGGCGCACTTTCACCCCTCTCCGGCCATGGGCGGATTCGGACTCAAACACCCACCGCCCATGATGCTCATCAAGAACACGCCTGTGCCCGGAAATATCACCAGCTTCTCGGACGTGCCCGTCAGCAGCTTCATCACCCAGTACAGCACCGGGCAGGTCACCGTGGAGATGGAGTGGGAGCTCAAGAAGGAAAACTCCAAGAGGTGGAACCCAGAGATCCAGTACACAAACAACTACAACGACCCCCAGTTTGTGGACTTTGCCCCGGACAGCACCGGGGAATACAGAACCACCAGACCTATCGGAACCCGATACCTTACCCGACCCCTTTAA

Aav-6 Full Genome: AF028704

Rep78: (SEQ ID NO:15)

ATGCCGGGGTTTTACGAGATTGTGATTAAGGTCCCCAGCGACCTTGACGAGCATCTGCCCGGCATTTCTGACAGCTTTGTGAACTGGGTGGCCGAGAAGGAATGGGAGTTGCCGCCAGATTCTGACATGGATCTGAATCTGATTGAGCAGGCACCCCTGACCGTGGCCGAGAAGCTGCAGCGCGACTTCCTGGTCCAGTGGCGCCGCGTGAGTAAGGCCCCGGAGGCCCTCTTCTTTGTTCAGTTCGAGAAGGGCGAGTCCTACTTCCACCTCCATATTCTGGTGGAGACCACGGGGGTCAAATCCATGGTGCTGGGCCGCTTCCTGAGTCAGATTAGGGACAAGCTGGTGCAGACCATCTACCGCGGGATCGAGCCGACCCTGCCCAACTGGTTCGCGGTGACCAAGACGCGTAATGGCGCCGGAGGGGGGAACAAGGTGGTGGACGAGTGCTACATCCCCAACTACCTCCTGCCCAAGACTCAGCCCGAGCTGCAGTGGGCGTGGACTAACATGGAGGAGTATATAAGCGCGTGTTTAAACCTGGCCGAGCGCAAACGGCTCGTGGCGCACGACCTGACCCACGTCAGCCAGACCCAGGAGCAGAACAAGGAGAATCTGAACCCCAATTCTGACGCGCCTGTCATCCGGTCAAAAACCTCCGCACGCTACATGGAGCTGGTCGGGTGGCTGGTGGACCGGGGCATCACCTCCGAGAAGCAGTGGATCCAGGAGGACCAGGCCTCGTACATCTCCTTCAACGCCGCCTCCAACTCGCGGTCCCAGATCAAGGCCGCTCTGGACAATGCCGGCAAGATCATGGCGCTGACCAAATCCGCGCCCGACTACCTGGTAGGCCCCGCTCCGCCCGCCGACATTAAAACCAACCGCATTTACCGCATCCTGGAGCTGAACGGCTACGACCCTGCCTACGCCGGCTCCGTCTTTCTCGGCTGGGCCCAGAAAAGGTTCGGAAAACGCAACACCATCTGGCTGTTTGGGCCGGCCACCACGGGCAAGACCAACATCGCGGAAGCCATCGCCCACGCCGTGCCCTTCTACGGCTGCGTCAACTGGACCAATGAGAACTTTCCCTTCAACGATTGCGTCGACAAGATGGTGATCTGGTGGGAGGAGGGCAAGATGACGGCCAAGGTCGTGGAGTCCGCCAAGGCCATTCTCGGCGGCAGCAAGGTGCGCGTGGACCAAAAGTGCAAGTCGTCCGCCCAGATCGATCCCACCCCCGTGATCGTCACCTCCAACACCAACATGTGCGCCGTGATTGACGGGAACAGCACCACCTTCGAGCACCAGCAGCCGTTGCAGGACCGGATGTTCAAATTTGAACTCACCCGCCGTCTGGAGCATGACTTTGGCAAGGTGACAAAGCAGGAAGTCAAAGAGTTCTTCCGCTGGGCGCAGGATCACGTGACCGAGGTGGCGCATGAGTTCTACGTCAGAAAGGGTGGAGCCAACAAGAGACCCGCCCCCGATGACGCGGATAAAAGCGAGCCCAAGCGGGCCTGCCCCTCAGTCGCGGATCCATCGACGTCAGACGCGGAAGGAGCTCCGGTGGACTTTGCCGACAGGTACCAAAACAAATGTTCTCGTCACGCGGGCATGCTTCAGATGCTGTTTCCCTGCAAAACATGCGAGAGAATGAATCAGAATTTCAACATTTGCTTCACGCACGGGACCAGAGACTGTTCAGAATGTTTCCCCGGCGTGTCAGAATCTCAACCGGTCGTCAGAAAGAGGACGTATCGGAAACTCTGTGCCATTCATCATCTGCTGGGGCGGGCTCCCGAGATTGCTTGCTCGGCCTGCGATCTGGTCAACGTGGATCTGGATGACTGTGTTTC TGAGCAATAA

CapVP1: (SEQ ID NO:16)

ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCGCGAGTGGTGGGACTTGAAACCTGGAGCCCCGAAACCCAAAGCCAACCAGCAAAAGCAGGACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACTCGACAAGGGGGAGCCCGTCAACGCGGCGGATGCAGCGGCCCTCGAGCACGACAAGGCCTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGACGCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAGCAGTCTTCCAGGCCAAGAAGAGGGTTCTCGAACCTTTTGGTCTGGTTGAGGAAGGTGCTAAGACGGCTCCTGGAAAGAAACGTCCGGTAGAGCAGTCGCCACAAGAGCCAGACTCCTCCTCGGGCATTGGCAAGACAGGCCAGCAGCCCGCTAAAAAGAGACTCAATTTTGGTCAGACTGGCGACTCAGAGTCAGTCCCCGACCCACAACCTCTCGGAGAACCTCCAGCAACCCCCGCTGCTGTGGGACCTACTACAATGGCTTCAGGCGGTGGCGCACCAATGGCAGACAATAACGAAGGCGCCGACGGAGTGGGTAATGCCTCAGGAAATTGGCATTGCGATTCCACATGGCTGGGCGACAGAGTCATCACCACCAGCACCCGAACATGGGCCTTGCCCACCTATAACAACCACCTCTACAAGCAAATCTCCAGTGCTTCAACGGGGGCCAGCAACGACAACCACTACTTCGGCTACAGCACCCCCTGGGGGTATTTTGATTTCAACAGATTCCACTGCCATTTCTCACCACGTGACTGGCAGCGACTCATCAACAACAATTGGGGATTCCGGCCCAAGAGACTCAACTTCAAGCTCTTCAACATCCAAGTCAAGGAGGTCACGACGAATGATGGCGTCACGACCATCGCTAATAACCTTACCAGCACGGTTCAAGTCTTCTCGGACTCGGAGTACCAGTTGCCGTACGTCCTCGGCTCTGCGCACCAGGGCTGCCTCCCTCCGTTCCCGGCGGACGTGTTCATGATTCCGCAGTACGGCTACCTAACGCTCAACAATGGCAGCCAGGCAGTGGGACGGTCATCCTTTTACTGCCTGGAATATTTCCCATCGCAGATGCTGAGAACGGGCAATAACTTTACCTTCAGCTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCGCACAGCCAGAGCCTGGACCGGCTGATGAATCCTCTCATCGACCAGTACCTGTATTACCTGAACAGAACTCAGAATCAGTCCGGAAGTGCCCAAAACAAGGACTTGCTGTTTAGCCGGGGGTCTCCAGCTGGCATGTCTGTTCAGCCCAAAAACTGGCTACCTGGACCCTGTTACCGGCAGCAGCGCGTTTCTAAAACAAAAACAGACAACAACAACAGCAACTTTACCTGGACTGGTGCTTCAAAATATAACCTTAATGGGCGTGAATCTATAATCAACCCTGGCACTGCTATGGCCTCACACAAAGACGACAAAGACAAGTTCTTTCCCATGAGCGGTGTCATGATTTTTGGAAAGGAGAGCGCCGGAGCTTCAAACACTGCATTGGACAATGTCATGATCACAGACGAAGAGGAAATCAAAGCCACTAACCCCGTGGCCACCGAAAGATTTGGGACTGTGGCAGTCAATCTCCAGAGCAGCAGCACAGACCCTGCGACCGGAGATGTGCATGTTATGGGAGCCTTACCTGGAATGGTGTGGCAAGACAGAGACGTATACCTGCAGGGTCCTATTTGGGCCAAAATTCCTCACACGGATGGACACTTTCACCCGTCTCCTCTCATGGGCGGCTTTGGACTTAAGCACCCGCCTCCTCAGATCCTCATCAAAAACACGCCTGTTCCTGCGAATCCTCCGGCAGAGTTTTCGGCTACAAAGTTTGCTTCATTCATCACCCAGTATTCCACAGGACAAGTGAGCGTGGAGATTGAATGGGAGCTGCAGAAAGAAAACAGCAAACGCTGGAATCCCGAAGTGCAGTATACATCTAACTATGCAAAATCTGCCAACGTTGATTTCACTGTGGACAACAATGGACTTTATACTGAGCCTCGCCCCATTGGCACCCGTTACCTCACCC GTCCCCTGTAA

Aav-7 Full Genome: NC_006260

Rep78: (SEQ ID NO:17)

ATGCCGGGTTTCTACGAGATCGTGATCAAGGTGCCGAGCGACCTGGACGAGCACCTGCCGGGCATTTCTGACTCGTTTGTGAACTGGGTGGCCGAGAAGGAATGGGAGCTGCCCCCGGATTCTGACATGGATCTGAATCTGATCGAGCAGGCACCCCTGACCGTGGCCGAGAAGCTGCAGCGCGACTTCCTGGTCCAATGGCGCCGCGTGAGTAAGGCCCCGGAGGCCCTGTTCTTTGTTCAGTTCGAGAAGGGCGAGAGCTACTTCCACCTTCACGTTCTGGTGGAGACCACGGGGGTCAAGTCCATGGTGCTAGGCCGCTTCCTGAGTCAGATTCGGGAGAAGCTGGTCCAGACCATCTACCGCGGGGTCGAGCCCACGCTGCCCAACTGGTTCGCGGTGACCAAGACGCGTAATGGCGCCGGCGGGGGGAACAAGGTGGTGGACGAGTGCTACATCCCCAACTACCTCCTGCCCAAGACCCAGCCCGAGCTGCAGTGGGCGTGGACTAACATGGAGGAGTATATAAGCGCGTGTTTGAACCTGGCCGAACGCAAACGGCTCGTGGCGCAGCACCTGACCCACGTCAGCCAGACGCAGGAGCAGAACAAGGAGAATCTGAACCCCAATTCTGACGCGCCCGTGATCAGGTCAAAAACCTCCGCGCGCTACATGGAGCTGGTCGGGTGGCTGGTGGACCGGGGCATCACCTCCGAGAAGCAGTGGATCCAGGAGGACCAGGCCTCGTACATCTCCTTCAACGCCGCCTCCAACTCGCGGTCCCAGATCAAGGCCGCGCTGGACAATGCCGGCAAGATCATGGCGCTGACCAAATCCGCGCCCGACTACCTGGTGGGGCCCTCGCTGCCCGCGGACATTAAAACCAACCGCATCTACCGCATCCTGGAGCTGAACGGGTACGATCCTGCCTACGCCGGCTCCGTCTTTCTCGGCTGGGCCCAGAAAAAGTTCGGGAAGCGCAACACCATCTGGCTGTTTGGGCCCGCCACCACCGGCAAGACCAACATTGCGGAAGCCATCGCCCACGCCGTGCCCTTCTACGGCTGCGTCAACTGGACCAATGAGAACTTTCCCTTCAACGATTGCGTCGACAAGATGGTGATCTGGTGGGAGGAGGGCAAGATGACGGCCAAGGTCGTGGAGTCCGCCAAGGCCATTCTCGGCGGCAGCAAGGTGCGCGTGGACCAAAAGTGCAAGTCGTCCGCCCAGATCGACCCCACCCCCGTGATCGTCACCTCCAACACCAACATGTGCGCCGTGATTGACGGGAACAGCACCACCTTCGAGCACCAGCAGCCGTTGCAGGACCGGATGTTCAAATTTGAACTCACCCGCCGTCTGGAGCACGACTTTGGCAAGGTGACGAAGCAGGAAGTCAAAGAGTTCTTCCGCTGGGCCAGTGATCACGTGACCGAGGTGGCGCATGAGTTCTACGTCAGAAAGGGCGGAGCCAGCAAAAGACCCGCCCCCGATGACGCGGATATAAGCGAGCCCAAGCGGGCCTGCCCCTCAGTCGCGGATCCATCGACGTCAGACGCGGAAGGAGCTCCGGTGGACTTTGCCGACAGGTACCAAAACAAATGTTCTCGTCACGCGGGCATGATTCAGATGCTGTTTCCCTGCAAAACGTGCGAGAGAATGAATCAGAATTTCAACATTTGCTTCACACACGGGGTCAGAGACTGTTTAGAGTGTTTCCCCGGCGTGTCAGAATCTCAACCGGTCGTCAGAAAAAAGACGTATCGGAAACTCTGCGCGATTCATCATCTGCTGGGGCGGGCGCCCGAGATTGCTTGCTCGGCCTGCGACCTGGTCAACGTGGACCTGGACGACTGCGTTTCTGAGCAATAA

CapVP1: (SEQ ID NO:18)

ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCGCGAGTGGTGGGACCTGAAACCTGGAGCCCCGAAACCCAAAGCCAACCAGCAAAAGCAGGACAACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACTCGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGCCTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGACGCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCATTTGGGGGCAACCTCGGGCGAGCAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCTAAGACGGCTCCTGCAAAGAAGAGACCGGTAGAGCCGTCACCTCAGCGTTCCCCCGACTCCTCCACGGGCATCGGCAAGAAAGGCCAGCAGCCCGCCAGAAAGAGACTCAATTTCGGTCAGACTGGCGACTCAGAGTCAGTCCCCGACCCTCAACCTCTCGGAGAACCTCCAGCAGCGCCCTCTAGTGTGGGATCTGGTACAGTGGCTGCAGGCGGTGGCGCACCAATGGCAGACAATAACGAAGGTGCCGACGGAGTGGGTAATGCCTCAGGAAATTGGCATTGCGATTCCACATGGCTGGGCGACAGAGTCATTACCACCAGCACCCGAACCTGGGCCCTGCCCACCTACAACAACCACCTCTACAAGCAAATCTCCAGTGAAACTGCAGGTAGTACCAACGACAACACCTACTTCGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTCTCACCACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAAGCTGCGGTTCAAGCTCTTCAACATCCAGGTCAAGGAGGTCACGACGAATGACGGCGTTACGACCATCGCTAATAACCTTACCAGCACGATTCAGGTATTCTCGGACTCGGAATACCAGCTGCCGTACGTCCTCGGCTCTGCGCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGTCTTCATGATTCCTCAGTACGGCTACCTGACTCTCAACAATGGCAGTCAGTCTGTGGGACGTTCCTCCTTCTACTGCCTGGAGTACTTCCCCTCTCAGATGCTGAGAACGGGCAACAACTTTGAGTTCAGCTACAGCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCACACAGCCAGAGCCTGGACCGGCTGATGAATCCCCTCATCGACCAGTACTTGTACTACCTGGCCAGAACACAGAGTAACCCAGGAGGCACAGCTGGCAATCGGGAACTGCAGTTTTACCAGGGCGGGCCTTCAACTATGGCCGAACAAGCCAAGAATTGGTTACCTGGACCTTGCTTCCGGCAACAAAGAGTCTCCAAAACGCTGGATCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCTGAACGGCAGAAACTCGTTGGTTAATCCCGGCGTCGCCATGGCAACTCACAAGGACGACGAGGACCGCTTTTTCCCATCCAGCGGAGTCCTGATTTTTGGAAAAACTGGAGCAACTAACAAAACTACATTGGAAAATGTGTTAATGACAAATGAAGAAGAAATTCGTCCTACTAATCCTGTAGCCACGGAAGAATACGGGATAGTCAGCAGCAACTTACAAGCGGCTAATACTGCAGCCCAGACACAAGTTGTCAACAACCAGGGAGCCTTACCTGGCATGGTCTGGCAGAACCGGGACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGATGGCAACTTTCACCCGTCTCCTTTGATGGGCGGCTTTGGACTTAAACATCCGCCTCCTCAGATCCTGATCAAGAACACTCCCGTTCCCGCTAATCCTCCGGAGGTGTTTACTCCTGCCAAGTTTGCTTCGTTCATCACACAGTACAGCACCGGACAAGTCAGCGTGGAAATCGAGTGGGAGCTGCAGAAGGAAAACAGCAAGCGCTGGAACCCGGAGATTCAGTACACCTCCAACTTTGAAAAGCAGACTGGTGTGGACTTTGCCGTTGACAGCCAGGGTGTTTACTCTGAGCCTCGCCCTATTGGCACTCGTTACCTCA CCCGTAATCTGTAA

Aav-8 Full Genome: NC_006261

Rep78: (SEQ ID NO:19)

ATGCCGGGCTTCTACGAGATCGTGATCAAGGTGCCGAGCGACCTGGACGAGCACCTGCCGGGCATTTCTGACTCGTTTGTGAACTGGGTGGCCGAGAAGGAATGGGAGCTGCCCCCGGATTCTGACATGGATCGGAATCTGATCGAGCAGGCACCCCTGACCGTGGCCGAGAAGCTGCAGCGCGACTTCCTGGTCCAATGGCGCCGCGTGAGTAAGGCCCCGGAGGCCCTCTTCTTTGTTCAGTTCGAGAAGGGCGAGAGCTACTTTCACCTGCACGTTCTGGTCGAGACCACGGGGGTCAAGTCCATGGTGCTAGGCCGCTTCCTGAGTCAGATTCGGGAAAAGCTTGGTCCAGACCATCTACCCGCGGGGTCGAGCCCCACCTTGCCCAACTGGTTCGCGGTGACCAAAGACGCGGTAATGGCGCCGGCGGGGGGGAACAAGGTGGTGGACGAGTGCTACATCCCCAACTACCTCCTGCCCAAGACTCAGCCCGAGCTGCAGTGGGCGTGGACTAACATGGAGGAGTATATAAGCGCGTGCTTGAACCTGGCCGAGCGCAAACGGCTCGTGGCGCAGCACCTGACCCACGTCAGCCAGACGCAGGAGCAGAACAAGGAGAATCTGAACCCCAATTCTGACGCGCCCGTGATCAGGTCAAAAACCTCCGCGCGCTATATGGAGCTGGTCGGGTGGCTGGTGGACCGGGGCATCACCTCCGAGAAGCAGTGGATCCAGGAGGACCAGGCCTCGTACATCTCCTTCAACGCCGCCTCCAACTCGCGGTCCCAGATCAAGGCCGCGCTGGACAATGCCGGCAAGATCATGGCGCTGACCAAATCCGCGCCCGACTACCTGGTGGGGCCCTCGCTGCCCGCGGACATTACCCAGAACCGCATCTACCGCATCCTCGCTCTCAACGGCTACGACCCTGCCTACGCCGGCTCCGTCTTTCTCGGCTGGGCTCAGAAAAAGTTCGGGAAACGCAACACCATCTGGCTGTTTGGACCCGCCACCACCGGCAAGACCAACATTGCGGAAGCCATCGCCCACGCCGTGCCCTTCTACGGCTGCGTCAACTGGACCAATGAGAACTTTCCCTTCAATGATTGCGTCGACAAGATGGTGATCTGGTGGGAGGAGGGCAAGATGACGGCCAAGGTCGTGGAGTCCGCCAAGGCCATTCTCGGCGGCAGCAAGGTGCGCGTGGACCAAAAGTGCAAGTCGTCCGCCCAGATCGACCCCACCCCCGTGATCGTCACCTCCAACACCAACATGTGCGCCGTGATTGACGGGAACAGCACCACCTTCGAGCACCAGCAGCCTCTCCAGGACCGGATGTTTAAGTTCGAACTCACCCGCCGTCTGGAGCACGACTTTGGCAAGGTGACAAAGCAGGAAGTCAAAGAGTTCTTCCGCTGGGCCAGTGATCACGTGACCGAGGTGGCGCATGAGTTTTACGTCAGAAAGGGCGGAGCCAGCAAAAGACCCGCCCCCGATGACGCGGATAAAAGCGAGCCCAAGCGGGCCTGCCCCTCAGTCGCGGATCCATCGACGTCAGACGCGGAAGGAGCTCCGGTGGACTTTGCCGACAGGTACCAAAACAAATGTTCTCGTCACGCGGGCATGCTTCAGATGCTGTTTCCCTGCAAAACGTGCGAGAGAATGAATCAGAATTTCAACATTTGCTTCACACACGGGGTCAGAGACTGCTCAGAGTGTTTCCCCGGCGTGTCAGAATCTCAACCGGTCGTCAGAAAGAGGACGTATCGGAAACTCTGTGCGATTCATCATCTGCTGGGGCGGGCTCCCGAGATTGCTTGCTCGGCCTGCGATCTGGTCAACGTGGACCTGGATGACTGTGTTTCTGAGCAATAA

CapVP1: (SEQ ID NO: 20)

ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCGCGAGTGGTGGGCGCTGAAACCTGGAGCCCCGAAGCCCAAAGCCAACCAGCAAAAGCAGGACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACTCGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGCCTACGACCAGCAGCTGCAGGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGACGCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAGCAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGGCGCTAAGACGGCTCCTGGAAAGAAGAGACCGGTAGAGCCATCACCCCAGCGTTCTCCAGACTCCTCTACGGGCATCGGCAAGAAAGGCCAACAGCCCGCCAGAAAAAGACTCAATTTTGGTCAGACTGGCGACTCAGAGTCAGTTCCAGACCCTCAACCTCTCGGAGAACCTCCAGCAGCGCCCTCTGGTGTGGGACCTAATACAATGGCTGCAGGCGGTGGCGCACCAATGGCAGACAATAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCACATGGCTGGGCGACAGAGTCATCACCACCAGCACCCGAACCTGGGCCCTGCCCACCTACAACAACCACCTCTACAAGCAAATCTCCAACGGGACATCGGGAGGAGCCACCAACGACAACACCTACTTCGGCTACAGCACCCCCTGGGGGTATTTTGACTTTAACAGATTCCACTGCCACTTTTCACCACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCCAAGAGACTCAGCTTCAAGCTCTTCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCAAGACCATCGCCAATAACCTCACCAGCACCATCCAGGTGTTTACGGACTCGGAGTACCAGCTGCCGTACGTTCTCGGCTCTGCCCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGACGTGTTCATGATTCCCCAGTACGGCTACCTAACACTCAACAACGGTAGTCAGGCCGTGGGACGCTCCTCCTTCTACTGCCTGGAATACTTTCCTTCGCAGATGCTGAGAACCGGCAACAACTTCCAGTTTACTTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCCCACAGCCAGAGCTTGGACCGGCTGATGAATCCTCTGATTGACCAGTACCTGTACTACTTGTCTCGGACTCAAACAACAGGAGGCACGGCAAATACGCAGACTCTGGGCTTCAGCCAAGGTGGGCCTAATACAATGGCCAATCAGGCAAAGAACTGGCTGCCAGGACCCTGTTACCGCCAACAACGCGTCTCAACGACAACCGGGCAAAACAACAATAGCAACTTTGCCTGGACTGCTGGGACCAAATACCATCTGAATGGAAGAAATTCATTGGCTAATCCTGGCATCGCTATGGCAACACACAAAGACGACGAGGAGCGTTTTTTTCCCAGTAACGGGATCCTGATTTTTGGCAAACAAAATGCTGCCAGAGACAATGCGGATTACAGCGATGTCATGCTCACCAGCGAGGAAGAAATCAAAACCACTAACCCTGTGGCTACAGAGGAATACGGTATCGTGGCAGATAACTTGCAGCAGCAAAACACGGCTCCTCAAATTGGAACTGTCAACAGCCAGGGGGCCTTACCCGGTATGGTCTGGCAGAACCGGGACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAACTTCCACCCGTCTCCGCTGATGGGCGGCTTTGGCCTGAAACATCCTCCGCCTCAGATCCTGATCAAGAACACGCCTGTACCTGCGGATCCTCCGACCACCTTCAACCAGTCAAAGCTGAACTCTTTCATCACGCAATACAGCACCGGACAGGTCAGCGTGGAAATTGAATGGGAGCTGCAGAAGGAAAACAGCAAGCGCTGGAACCCCGAGATCCAGTACACCTCCAACTACTACAAATCTACAAGTGTGGACTTTGCTGTTAATACAGAAGGCGTGTACTCTGAACCCCGCCCCATTGGCACCCGTTACC TCACCCGTAATCTGTAA

Aav-9 Cap Only: AY530579

CapVP1: (SEQ ID NO:21)

ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACAACCTTAGTGAAGGAATTCGCGAGTGGTGGGCTTTGAAACCTGGAGCCCCTCAACCCAAGGCAAATCAACAACATCAAGACAACGCTCGAGGTCTTGTGCTTCCGGGTTACAAATACCTTGGACCCGGCAACGGACTCGACAAGGGGGAGCCGGTCAACGCAGCAGACGCGGCGGCCCTCGAGCACGACAAGGCCTACGACCAGCAGCTCAAGGCCGGAGACAACCCGTACCTCAAGTACAACCACGCCGACGCCGAGTTCCAGGAGCGGCTCAAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAGCAGTCTTCCAGGCCAAAAAGAGGCTTCTTGAACCTCTTGGTCTGGTTGAGGAAGCGGCTAAGACGGCTCCTGGAAAGAAGAGGCCTGTAGAGCAGTCTCCTCAGGAACCGGACTCCTCCGCGGGTATTGGCAAATCGGGTGCACAGCCCGCTAAAAAGAGACTCAATTTCGGTCAGACTGGCGACACAGAGTCAGTCCCAGACCCTCAACCAATCGGAGAACCTCCCGCAGCCCCCTCAGGTGTGGGATCTCTTACAATGGCTTCAGGTGGTGGCGCACCAGTGGCAGACAATAACGAAGGTGCCGATGGAGTGGGTAGTTCCTCGGGAAATTGGCATTGCGATTCCCAATGGCTGGGGGACAGAGTCATCACCACCAGCACCCGAACCTGGGCCCTGCCCACCTACAACAATCACCTCTACAAGCAAATCTCCAACAGCACATCTGGAGGATCTTCAAATGACAACGCCTACTTCGGCTACAGCACCCCCTGGGGGTATTTTGACTTCAACAGATTCCACTGCCACTTCTCACCACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCTAAGCGACTCAACTTCAAGCTCTTCAACATTCAGGTCAAAGAGGTTACGGACAACAATGGAGTCAAGACCATCGCCAATAACCTTACCAGCACGGTCCAGGTCTTCACGGACTCAGACTATCAGCTCCCGTACGTGCTCGGGTCGGCTCACGAGGGCTGCCTCCCGCCGTTCCCAGCGGACGTTTTCATGATTCCTCAGTACGGGTATCTGACGCTTAATGATGGAAGCCAGGCCGTGGGTCGTTCGTCCTTTTACTGCCTGGAATATTTCCCGTCGCAAATGCTAAGAACGGGTAACAACTTCCAGTTCAGCTACGAGTTTGAGAACGTACCTTTCCATAGCAGCTACGCTCACAGCCAAAGCCTGGACCGACTAATGAATCCACTCATCGACCAATACTTGTACTATCTCTCAAAGACTATTAACGGTTCTGGACAGAATCAACAAACGCTAAAATTCAGTGTGGCCGGACCCAGCAACATGGCTGTCCAGGGAAGAAACTACATACCTGGACCCAGCTACCGACAACAACGTGTCTCAACCACTGTGACTCAAAACAACAACAGCGAATTTGCTTGGCCTGGAGCTTCTTCTTGGGCTCTCAATGGACGTAATAGCTTGATGAATCCTGGACCTGCTATGGCCAGCCACAAAGAAGGAGAGGACCGTTTCTTTCCTTTGTCTGGATCTTTAATTTTTGGCAAACAAGGAACTGGAAGAGACAACGTGGATGCGGACAAAGTCATGATAACCAACGAAGAAGAAATTAAAACTACTAACCCGGTAGCAACGGAGTCCTATGGACAAGTGGCCACAAACCACCAGAGTGCCCAAGCACAGGCGCAGACCGGCTGGGTTCAAAACCAAGGAATACTTCCGGGTATGGTTTGGCAGGACAGAGATGTGTACCTGCAAGGACCCATTTGGGCCAAAATTCCTCACACGGACGGCAACTTTCACCCTTCTCCGCTGATGGGAGGGTTTGGAATGAAGCACCCGCCTCCTCAGATCCTCATCAAAAACACACCTGTACCTGCGGATCCTCCAACGGCCTTCAACAAGGACAAGCTGAACTCTTTCATCACCCAGTATTCTACTGGCCAAGTCAGCGTGGAGATCGAGTGGGAGCTGCAGAAGGAAAACAGCAAGCGCTGGAACCCGGAGATCCAGTACACTTCCAACTATTACAAGTCTAATAATGTTGAATTTGCTGTTAATACTGAAGGTGTATATAGTGAACCCCGCCCCATTGGCACCAGATACCTGACTC GTAATCTGTAA

Aav-10 Partial Genome: AY631965

Rep78: (SEQ ID NO:22)

ATGCCGGGCTTCTACGAGATCGTGATCAAGGTGCCGAGCGACCTGGACGAGCACCTGCCGGGCATTTCTGACTCGTTTGTGAACTGGGTGGCCGAGAAGGAATGGGAGCTGCCCCCGGATTCTGACATGGATCGGAATCTGATCGAGCAGGCACCCCTGACCGTGGCCGAGAAGCTGCAGCGCGACTTCCTGGTCCACTGGCGCCGCGTGAGTAAGGCCCCGGAGGCCCTCTTCTTTGTTCAGTTCGAGAAGGGCGAGTCCTACTTTCACCTGCACGTTCTGGTCGAGACCACGGGGGTCAAGTCCATGGTCCTGGGCCGCTTCCTGAGTCAGATCAGAGACAGGCTGGTGCAGACCATCTACCGCGGGGTAGAGCCCACGCTGCCCAACTGGTTCGCGGTGACCAAGACGCGAAATGGCGCCGGCGGGGGGAACAAGGTGGTGGACGAGTGCTACATCCCCAACTACCTCCTGCCCAAGACGCAGCCCGAGCTGCAGTGGGCGTGGACTAACATGGAGGAGTATATAAGCGCGTGTCTGAACCTCGCGGAGCGTAAACGGCTCGTGGCGCAGCACCTGACCCACGTCAGCCAGACGCAGGAGCAGAACAAGGAGAATCTGAACCCGAATTCTGACGCGCCCGTGATCAGGTCAAAAACCTCCGCGCGCTACATGGAGCTGGTCGGGTGGCTGGTGGACCGGGGCATCACCTCCGAGAAGCAGTGGATCCAGGAGGACCAGGCCTCGTACATCTCCTTCAACGCCGCCTCCAACTCGCGGTCCCAGATCAAGGCCGCGCTGGACAATGCCGGAAAGATCATGGCGCTGACCAAATCCGCGCCCGACTACCTGGTAGGCCCGTCCTTACCCGCGGACATTAAGGCCAACCGCATCTACCGCATCCTGGAGCTCAACGGCTACGACCCCGCCTACGCCGGCTCCGTCTTCCTGGGCTGGGCGCAGAAAAAGTTCGGTAAAAGGAATACAATTTGGCTGTTCGGGCCCGCCACCACCGGCAAGACCAACATCGCGGAAGCCATCGCCCACGCCGTGCCCTTCTACGGCTGCGTCAACTGGACCAATGAGAACTTTCCCTTCAACGATTGCGTCGACAAGATGGTGATCTGGTGGGAGGAGGGCAAGATGACCGCCAAGGTCGTGGAGTCCGCCAAGGCCATTCTGGGCGGAAGCAAGGTGCGCGTCGACCAAAAGTGCAAGTCCTCGGCCCAGATCGACCCCACGCCCGTGATCGTCACCTCCAACACCAACATGTGCGCCGTGATCGACGGGAACAGCACCACCTTCGAGCACCAGCAGCCCCTGCAGGACCGCATGTTCAAGTTCGAGCTCACCCGCCGTCTGGAGCACGACTTTGGCAAGGTGACCAAGCAGGAAGTCAAAGAGTTCTTCCGCTGGGCTCAGGATCACGTGACTGAGGTGACGCATGAGTTCTACGTCAGAAAGGGCGGAGCCACCAAAAGACCCGCCCCCAGTGACGCGGATATAAGCGAGCCCAAGCGGGCCTGCCCCTCAGTTGCGGAGCCATCGACGTCAGACGCGGAAGCACCGGTGGACTTTGCGGACAGGTACCAAAACAAATGTTCTCGTCACGCGGGCATGCTTCAGATGCTGTTTCCCTGCAAGACATGCGAGAGAATGAATCAGAATTTCAACGTCTGCTTCACGCACGGGGTCAGAGACTGCTCAGAGTGCTTCCCCGGCGCGTCAGAATCTCAACCTGTCGTCAGAAAAAAGACGTATCAGAAACTGTGCGCGATTCATCATCTGCTGGGGCGGGCACCCGAGATTGCGTGTTCGGCCTGCGATCTCGTCAACGTGGACTTGGATGA CTGTGTTTCTGAGCAATAA

CapVP1: (SEQ ID NO:23)

ATGGCTGCTGACGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCGCGAGTGGTGGGACCTGAAACCTGGAGCCCCCAAGCCCAAGGCCAACCAGCAGAAGCAGGACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACTCGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGCCTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGACGCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAGCAGTCTTCCAGGCCAAGAAGCGGGTTCTCGAACCTCTCGGTCTGGTTGAGGAAGCTGCTAAGACGGCTCCTGGAAAGAAGAGACCGGTAGAACCGTCACCTCAGCGTTCCCCCGACTCCTCCACGGGCATCGGCAAGAAAGGCCAGCAGCCCGCTAAAAAGAGACTGAACTTTGGGCAGACTGGCGAGTCAGAGTCAGTCCCCGACCCTCAACCAATCGGAGAACCACCAGCAGGCCCCTCTGGTCTGGGATCTGGTACAATGGCTGCAGGCGGTGGCGCTCCAATGGCAGACAATAACGAAGGCGCCGACGGAGTGGGTAGTTCCTCAGGAAATTGGCATTGCGATTCCACATGGCTGGGCGACAGAGTCATCACCACCAGCACCCGAACCTGGGCCCTGCCCACCTACAACAACCACCTCTACAAGCAAATCTCCAACGGGACATCGGGAGGAAGCACCAACGACAACACCTACTTCGGCTACAGCACCCCCTGGGGGTATTTTGACTTCAACAGATTCCACTGCCACTTCTCACCACGTGACTGGCAGCGACTCATCAACAACAACTGGGGATTCCGGCCAAAAAGACTCAGCTTCAAGCTCTTCAACATCCAGGTCAAGGAGGTCACGCAGAATGAAGGCACCAAGACCATCGCCAATAACCTTACCAGCACGATTCAGGTATTTACGGACTCGGAATACCAGCTGCCGTACGTCCTCGGCTCCGCGCACCAGGGCTGCCTGCCTCCGTTCCCGGCGGATGTCTTCATGATTCCCCAGTACGGCTACCTGACACTGAACAATGGAAGTCAAGCCGTAGGCCGTTCCTCCTTCTACTGCCTGGAATATTTTCCATCTCAAATGCTGCGAACTGGAAACAATTTTGAATTCAGCTACACCTTCGAGGACGTGCCTTTCCACAGCAGCTACGCACACAGCCAGAGCTTGGACCGACTGATGAATCCTCTCATTGACCAGTACCTGTACTACTTATCCAGAACTCAGTCCACAGGAGGAACTCAAGGTACCCAGCAATTGTTATTTTCTCAAGCTGGGCCTGCAAACATGTCGGCTCAGGCCAAGAACTGGCTGCCTGGACCTTGCTACCGGCAGCAGCGAGTCTCCACGACACTGTCGCAAAACAACAACAGCAACTTTGCTTGGACTGGTGCCACCAAATATCACCTGAACGGAAGAGACTCTCTGGTGAATCCCGGTGTCGCCATGGCAACCCACAAGGACGACGAGGAACGCTTCTTCCCGTCGAGCGGAGTCCTGATGTTTGGAAAACAGGGTGCTGGAAGAGACAATGTGGACTACAGCAGCGTTATGCTAACAAGCGAAGAAGAAATTAAAACCACTAACCCTGTAGCCACAGAACAATACGGCGTGGTGGCTGACAACTTGCAGCAAGCCAATACAGGGCCTATTGTGGGAAATGTCAACAGCCAAGGAGCCTTACCTGGCATGGTCTGGCAGAACCGAGACGTGTACCTGCAGGGTCCCATCTGGGCCAAGATTCCTCACACGGACGGCAACTTTCACCCGTCTCCTCTGATGGGCGGCTTTGGACTTAAACACCCGCCTCCACAGATCCTGATCAAGAACACGCCGGTACCTGCGGATCCTCCAACAACGTTCAGCCAGGCGAAATTGGCTTCCTTCATCACGCAGTACAGCACCGGACAGGTCAGCGTGGAAATCGAGTGGGAGCTGCAGAAGGAGAACAGCAAACGCTGGAACCCAGAGATTCAGTACACTTCAAACTACTACAAATCTACAAATGTGGACTTTGCTGTCAATACAGAGGGAACTTATTCTGAGCCTCGCCCCATTGGTACTCGTTATC TGACACGTAATCTGTAA

Aav-11 Partial Genome: AY631966

Rep78: (SEQ ID NO:24)

ATGCCGGGCTTCTACGAGATCGTGATCAAGGTGCCGAGCGACCTGGACGAGCACCTGCCGGGCATTTCTGACTCGTTTGTGAACTGGGTGGCCGAGAAGGAATGGGAGCTGCCCCCGGATTCTGACATGGATCGGAATCTGATCGAGCAGGCACCCCTGACCGTGGCCGAGAAGCTGCAGCGCGACTTCCTGGTCCACTGGCGCCGCGTGAGTAAGGCCCCGGAGGCCCTCTTCTTTGTTCAGTTCGAGAAGGGCGAGTCCTACTTCCACCTCCACGTTCTCGTCGAGACCACGGGGGTCAAGTCCATGGTCCTGGGCCGCTTCCTGAGTCAGATCAGAGACAGGCTGGTGCAGACCATCTACCGCGGGGTCGAGCCCACGCTGCCCAACTGGTTCGCGGTGACCAAGACGCGAAATGGCGCCGGCGGGGGGAACAAGGTGGTGGACGAGTGCTACATCCCCAACTACCTCCTGCCCAAGACCCAGCCCGAGCTGCAGTGGGCGTGGACTAACATGGAGGAGTATATAAGCGCGTGTCTAAACCTCGCGGAGCGTAAACGGCTCGTGGCGCAGCACCTGACCCACGTCAGCCAGACGCAGGAGCAGAACAAGGAGAATCTGAACCCGAATTCTGACGCGCCCGTGATCAGGTCAAAAACCTCCGCGCGCTACATGGAGCTGGTCGGGTGGCTGGTGGACCGGGGCATCACCTCCGAGAAGCAGTGGATCCAGGAGGACCAGGCCTCGTACATCTCCTTCAACGCCGCCTCCAACTCGCGGTCCCAGATCAAGGCCGCGCTGGACAATGCCGGAAAGATCATGGCGCTGACCAAATCCGCGCCCGACTACCTGGTAGGCCCGTCCTTACCCGCGGACATTAAGGCCAACCGCATCTACCGCATCCTGGAGCTCAACGGCTACGACCCCGCCTACGCCGGCTCCGTCTTCCTGGGCTGGGCGCAGAAAAAGTTCGGTAAACGCAACACCATCTGGCTGTTTGGGCCCGCCACCACCGGCAAGACCAACATCGCGGAAGCCATAGCCCACGCCGTGCCCTTCTACGGCTGCGTGAACTGGACCAATGAGAACTTTCCCTTCAACGATTGCGTCGACAAGATGGTGATCTGGTGGGAGGAGGGCAAGATGACCGCCAAGGTCGTGGAGTCCGCCAAGGCCATTCTGGGCGGAAGCAAGGTGCGCGTGGACCAAAAGTGCAAGTCCTCGGCCCAGATCGACCCCACGCCCGTGATCGTCACCTCCAACACCAACATGTGCGCCGTGATCGACGGGAACAGCACCACCTTCGAGCACCAGCAGCCGCTGCAGGACCGCATGTTCAAGTTCGAGCTCACCCGCCGTCTGGAGCACGACTTTGGCAAGGTGACCAAGCAGGAAGTCAAAGAGTTCTTCCGCTGGGCTCAGGATCACGTGACTGAGGTGGCGCATGAGTTCTACGTCAGAAAGGGCGGAGCCACCAAAAGACCCGCCCCCAGTGACGCGGATATAAGCGAGCCCAAGCGGGCCTGCCCCTCAGTTCCGGAGCCATCGACGTCAGACGCGGAAGCACCGGTGGACTTTGCGGACAGGTACCAAAACAAATGTTCTCGTCACGCGGGCATGCTTCAGATGCTGTTTCCCTGCAAGACATGCGAGAGAATGAATCAGAATTTCAACGTCTGCTTCACGCACGGGGTCAGAGACTGCTCAGAGTGCTTCCCCGGCGCGTCAGAATCTCAACCCGTCGTCAGAAAAAAGACGTATCAGAAACTGTGCGCGATTCATCATCTGCTGGGGCGGGCACCCGAGATTGCGTGTTCGGCCTGCGATCTCGTCAACGTGGACTTGGATGA CTGTGTTTCTGAGCAATAA

CapVP1: (SEQ ID NO:25)

ATGGCTGCTGACGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAGGGCATTCGCGAGTGGTGGGACCTGAAACCTGGAGCCCCGAAGCCCAAGGCCAACCAGCAGAAGCAGGACGACGGCCGGGGTCTGGTGCTTCCTGGCTACAAGTACCTCGGACCCTTCAACGGACTCGACAAGGGGGAGCCCGTCAACGCGGCGGACGCAGCGGCCCTCGAGCACGACAAGGCCTACGACCAGCAGCTCAAAGCGGGTGACAATCCGTACCTGCGGTATAACCACGCCGACGCCGAGTTTCAGGAGCGTCTGCAAGAAGATACGTCTTTTGGGGGCAACCTCGGGCGAGCAGTCTTCCAGGCCAAGAAGAGGGTACTCGAACCTCTGGGCCTGGTTGAAGAAGGTGCTAAAACGGCTCCTGGAAAGAAGAGACCGTTAGAGTCACCACAAGAGCCCGACTCCTCCTCGGGCATCGGCAAAAAAGGCAAACAACCAGCCAGAAAGAGGCTCAACTTTGAAGAGGACACTGGAGCCGGAGACGGACCCCCTGAAGGATCAGATACCAGCGCCATGTCTTCAGACATTGAAATGCGTGCAGCACCGGGCGGAAATGCTGTCGATGCGGGACAAGGTTCCGATGGAGTGGGTAATGCCTCGGGTGATTGGCATTGCGATTCCACCTGGTCTGAGGGCAAGGTCACAACAACCTCGACCAGAACCTGGGTCTTGCCCACCTACAACAACCACTTGTACCTGCGTCTCGGAACAACATCAAGCAGCAACACCTACAACGGATTCTCCACCCCCTGGGGATATTTTGACTTCAACAGATTCCACTGTCACTTCTCACCACGTGACTGGCAAAGACTCATCAACAACAACTGGGGACTACGACCAAAAGCCATGCGCGTTAAAATCTTCAATATCCAAGTTAAGGAGGTCACAACGTCGAACGGCGAGACTACGGTCGCTAATAACCTTACCAGCACGGTTCAGATATTTGCGGACTCGTCGTATGAGCTCCCGTACGTGATGGACGCTGGACAAGAGGGGAGCCTGCCTCCTTTCCCCAATGACGTGTTCATGGTGCCTCAATATGGCTACTGTGGCATCGTGACTGGCGAGAATCAGAACCAAACGGACAGAAACGCTTTCTACTGCCTGGAGTATTTTCCTTCGCAAATGTTGAGAACTGGCAACAACTTTGAAATGGCTTACAACTTTGAGAAGGTGCCGTTCCACTCAATGTATGCTCACAGCCAGAGCCTGGACAGACTGATGAATCCCCTCCTGGACCAGTACCTGTGGCACTTACAGTCGACTACCTCTGGAGAGACTCTGAATCAAGGCAATGCAGCAACCACATTTGGAAAAATCAGGAGTGGAGACTTTGCCTTTTACAGAAAGAACTGGCTGCCTGGGCCTTGTGTTAAACAGCAGAGATTCTCAAAAACTGCCAGTCAAAATTACAAGATTCCTGCCAGCGGGGGCAACGCTCTGTTAAAGTATGACACCCACTATACCTTAAACAACCGCTGGAGCAACATCGCGCCCGGACCTCCAATGGCCACAGCCGGACCTTCGGATGGGGACTTCAGTAACGCCCAGCTTATATTCCCTGGACCATCTGTTACCGGAAATACAACAACTTCAGCCAACAATCTGTTGTTTACATCAGAAGAAGAAATTGCTGCCACCAACCCAAGAGACACGGACATGTTTGGCCAGATTGCTGACAATAATCAGAATGCTACAACTGCTCCCATAACCGGCAACGTGACTGCTATGGGAGTGCTGCCTGGCATGGTGTGGCAAAACAGAGACATTTACTACCAAGGGCCAATTTGGGCCAAGATCCCACACGCGGACGGACATTTTCATCCTTCACCGCTGATTGGTGGGTTTGGACTGAAACACCCGCCTCCCCAGATATTCATCAAGAACACTCCCGTACCTGCCAATCCTGCGACAACCTTCACTGCAGCCAGAGTGGACTCTTTCATCACACAATACAGCACCGGCCAGGTCGCTGTTCAGATTGAATGGGAAATTGAAAAGGAACGCTCCAAACGCTGGAATCCTGAAGTGCAGTTTACTTCAAACTATGGGAACCAGTCTTCTATGTTGTGGGCTCCTGATACAACTGGGAAGTATACAGAGCCGCGGGTTATTGGCTCTCGTTATTTGACTAATCATTTGT AA

Aav-12 Partial Genome: DQ813647

Rep78: (SEQ ID NO:26)

ATGCCGGGGTTCTACGAGGTGGTGATCAAGGTGCCCAGCGACCTGGACGAGCACCTGCCCGGCATTTCTGACTCCTTTGTGAACTGGGTGGCCGAGAAGGAATGGGAGTTGCCCCCGGATTCTGACATGGATCAGAATCTGATTGAGCAGGCACCCCTGACCGTGGCCGAGAAGCTGCAGCGCGAGTTCCTGGTGGAATGGCGCCGAGTGAGTAAATTTCTGGAGGCCAAGTTTTTTGTGCAGTTTGAAAAGGGGGACTCGTACTTTCATTTGCATATTCTGATTGAAATTACCGGCGTGAAATCCATGGTGGTGGGCCGCTACGTGAGTCAGATTAGGGATAAACTGATCCAGCGCATCTACCGCGGGGTCGAGCCCCAGCTGCCCAACTGGTTCGCGGTCACAAAGACCCGAAATGGCGCCGGAGGCGGGAACAAGGTGGTGGACGAGTGCTACATCCCCAACTACCTGCTCCCCAAGGTCCAGCCCGAGCTTCAGTGGGCGTGGACTAACATGGAGGAGTATATAAGCGCCTGTTTGAACCTCGCGGAGCGTAAACGGCTCGTGGCGCAGCACCTGACGCACGTCTCCCAGACCCAGGAGGGCGACAAGGAGAATCTGAACCCGAATTCTGACGCGCCGGTGATCCGGTCAAAAACCTCCGCCAGGTACATGGAGCTGGTCGGGTGGCTGGTGGACAAGGGCATCACGTCCGAGAAGCAGTGGATCCAGGAGGACCAGGCCTCGTACATCTCCTTCAACGCGGCCTCCAACTCCCGGTCGCAGATCAAGGCGGCCCTGGACAATGCCTCCAAAATCATGAGCCTCACCAAAACGGCTCCGGACTATCTCATCGGGCAGCAGCCCGTGGGGGACATTACCACCAACCGGATCTACAAAATCCTGGAACTGAACGGGTACGACCCCCAGTACGCCGCCTCCGTCTTTCTCGGCTGGGCCCAGAAAAAGTTTGGAAAGCGCAACACCATCTGGCTGTTTGGGCCCGCCACCACCGGCAAGACCAACATCGCGGAAGCCATCGCCCACGCGGTCCCCTTCTACGGCTGCGTCAACTGGACCAATGAGAACTTTCCCTTCAACGACTGCGTCGACAAAATGGTGATTTGGTGGGAGGAGGGCAAGATGACCGCCAAGGTCGTAGAGTCCGCCAAGGCCATTCTGGGCGGCAGCAAGGTGCGCGTGGACCAAAAATGCAAGGCCTCTGCGCAGATCGACCCCACCCCCGTGATCGTCACCTCCAACACCAACATGTGCGCCGTGATTGACGGGAACAGCACCACCTTCGAGCACCAGCAGCCCCTGCAGGACCGGATGTTCAAGTTTGAACTCACCCGCCGCCTCGACCACGACTTTGGCAAGGTCACCAAGCAGGAAGTCAAGGACTTTTTCCGGTGGGCGGCTGATCACGTGACTGACGTGGCTCATGAGTTTTACGTCACAAAGGGTGGAGCTAAGAAAAGGCCCGCCCCCTCTGACGAGGATATAAGCGAGCCCAAGCGGCCGCGCGTGTCATTTGCGCAGCCGGAGACGTCAGACGCGGAAGCTCCCGGAGACTTCGCCGACAGGTACCAAAACAAATGTTCTCGTCACGCGGGTATGCTGCAGATGCTCTTTCCCTGCAAGACGTGCGAGAGAATGAATCAGAATTCCAACGTCTGCTTCACGCACGGTCAGAAAGATTGCGGGGAGTGCTTTCCCGGGTCAGAATCTCAACCGGTTTCTGTCGTCAGAAAAACGTATCAGAAACTGTGCATCCTTCATCAGCTCCGGGGGGCACCCGAGATCGCCTGCTCTGCTTGCGACCAACTCAACCCCGATTTGGACGATTG CCAATTTGAGCAATAA

CapVP1: (SEQ ID NO: 27)

ATGGCTGCTGACGGTTATCTTCCAGATTGGCTCGAGGACAACCTCTCTGAAGGCATTCGCGAGTGGTGGGCGCTGAAACCTGGAGCTCCACAACCCAAGGCCAACCAACAGCATCAGGACAACGGCAGGGGTCTTGTGCTTCCTGGGTACAAGTACCTCGGACCCTTCAACGGACTCGACAAGGGAGAGCCGGTCAACGAGGCAGACGCCGCGGCCCTCGAGCACGACAAGGCCTACGACAAGCAGCTCGAGCAGGGGGACAACCCGTATCTCAAGTACAACCACGCCGACGCCGAGTTCCAGCAGCGCTTGGCGACCGACACCTCTTTTGGGGGCAACCTCGGGCGAGCAGTCTTCCAGGCCAAAAAGAGGATTCTCGAGCCTCTGGGTCTGGTTGAAGAGGGCGTTAAAACGGCTCCTGGAAAGAAACGCCCATTAGAAAAGACTCCAAATCGGCCGACCAACCCGGACTCTGGGAAGGCCCCGGCCAAGAAAAAGCAAAAAGACGGCGAACCAGCCGACTCTGCTAGAAGGACACTCGACTTTGAAGACTCTGGAGCAGGAGACGGACCCCCTGAGGGATCATCTTCCGGAGAAATGTCTCATGATGCTGAGATGCGTGCGGCGCCAGGCGGAAATGCTGTCGAGGCGGGACAAGGTGCCGATGGAGTGGGTAATGCCTCCGGTGATTGGCATTGCGATTCCACCTGGTCAGAGGGCCGAGTCACCACCACCAGCACCCGAACCTGGGTCCTACCCACGTACAACAACCACCTGTACCTGCGAATCGGAACAACGGCCAACAGCAACACCTACAACGGATTCTCCACCCCCTGGGGATACTTTGACTTTAACCGCTTCCACTGCCACTTTTCCCCACGCGACTGGCAGCGACTCATCAACAACAACTGGGGACTCAGGCCGAAATCGATGCGTGTTAAAATCTTCAACATACAGGTCAAGGAGGTCACGACGTCAAACGGCGAGACTACGGTCGCTAATAACCTTACCAGCACGGTTCAGATCTTTGCGGATTCGACGTATGAACTCCCATACGTGATGGACGCCGGTCAGGAGGGGAGCTTTCCTCCGTTTCCCAACGACGTCTTTATGGTTCCCCAATACGGATACTGCGGAGTTGTCACTGGAAAAAACCAGAACCAGACAGACAGAAATGCCTTTTACTGCCTGGAATACTTTCCATCCCAAATGCTAAGAACTGGCAACAATTTTGAAGTCAGTTACCAATTTGAAAAAGTTCCTTTCCATTCAATGTACGCGCACAGCCAGAGCCTGGACAGAATGATGAATCCTTTACTGGATCAGTACCTGTGGCATCTGCAATCGACCACTACCGGAAATTCCCTTAATCAAGGAACAGCTACCACCACGTACGGGAAAATTACCACTGGAGACTTTGCCTACTACAGGAAAAACTGGTTGCCTGGAGCCTGCATTAAACAACAAAAATTTTCAAAGAATGCCAATCAAAACTACAAGATTCCCGCCAGCGGGGGAGACGCCCTTTTAAAGTATGACACGCATACCACTCTAAATGGGCGATGGAGTAACATGGCTCCTGGACCTCCAATGGCAACCGCAGGTGCCGGGGACTCGGATTTTAGCAACAGCCAGCTGATCTTTGCCGGACCCAATCCGAGCGGTAACACGACCACATCTTCAAACAATTTGTTGTTTACCTCAGAAGAGGAGATTGCCACAACAAACCCACGAGACACGGACATGTTTGGACAGATTGCAGATAATAATCAAAATGCCACCACCGCCCCTCACATCGCTAACCTGGACGCTATGGGAATTGTTCCCGGAATGGTCTGGCAAAACAGAGACATCTACTACCAGGGCCCTATTTGGGCCAAGGTCCCTCACACGGACGGACACTTTCACCCTTCGCCGCTGATGGGAGGATTTGGACTGAAACACCCGCCTCCACAGATTTTCATCAAAAACACCCCCGTACCCGCCAATCCCAATACTACCTTTAGCGCTGCAAGGATTAATTCTTTTCTGACGCAGTACAGCACCGGACAAGTTGCCGTTCAGATCGACTGGGAAATTCAGAAGGAGCATTCCAAACGCTGGAATCCCGAAGTTCAATTTACTTCAAACTACGGCACTCAAAATTCTATGCTGTGGGCTCCCGACAATGCTGGCAACTACCACGAACTCCGGGCTATTGGGTCCCGTTTCCTCACCCACCACTTGTAA

Aav-13 Partial Genome: EU285562

Rep78: (SEQ ID NO:28)

ATGCCGGGATTCTACGAGATTGTCCTGAAGGTGCCCAGCGACCTGGACGAGCACCTGCCTGGCATTTCTGACTCTTTTGTAAACTGGGTGGCGGAGAAGGAATGGGAGCTGCCGCCGGATTCTGACATGGATCTGAATCTGATTGAGCAGGCACCCCTAACCGTGGCCGAAAAGCTGCAACGCGAATTCCTGGTCGAGTGGCGCCGCGTGAGTAAGGCCCCGGAGGCCCTCTTCTTTGTTCAGTTCGAGAAGGGGGACAGCTACTTCCACCTACACATTCTGGTGGAGACCGTGGGCGTGAAATCCATGGTGGTGGGCCGCTACGTGAGCCAGATTAAAGAGAAGCTGGTGACCCGCATCTACCGCGGGGTCGAGCCGCAGCTTCCGAACTGGTTCGCGGTGACCAAGACGCGTAATGGCGCCGGAGGCGGGAACAAGGTGGTGGACGACTGCTACATCCCCAACTACCTGCTCCCCAAGACCCAGCCCGAGCTCCAGTGGGCGTGGACTAATATGGACCAGTATTTAAGCGCCTGTTTGAATCTCGCGGAGCGTAAACGGCTGGTGGCGCAGCATCTGACGCACGTGTCGCAGACGCAGGAGCAGAACAAAGAGAACCAGAATCCCAATTCTGACGCGCCGGTGATCAGATCAAAAACCTCCGCGAGGTACATGGAGCTGGTCGGGTGGCTGGTGGACCGCGGGATCACGTCAGAAAAGCAATGGATCCAGGAGGACCAGGCCTCTTACATCTCCTTCAACGCCGCCTCCAACTCGCGGTCACAAATCAAGGCCGCACTGGACAATGCCTCCAAATTTATGAGCCTGACAAAAACGGCTCCGGACTACCTGGTGGGAAACAACCCGCCGGAGGACATTACCAGCAACCGGATCTACAAAATCCTCGAGATGAACGGGTACGATCCGCAGTACGCGGCCTCCGTCTTCCTGGGCTGGGCGCAAAAGAAGTTCGGGAAGAGGAACACCATCTGGCTCTTTGGGCCGGCCACGACGGGTAAAACCAACATCGCTGAAGCTATCGCCCACGCCGTGCCCTTTTACGGCTGCGTGAACTGGACCAATGAGAACTTTCCGTTCAACGATTGCGTCGACAAGATGGTGATCTGGTGGGAGGAGGGCAAGATGACGGCCAAGGTCGTGGAGTCCGCCAAGGCCATTCTGGGCGGAAGCAAGGTGCGCGTGGACCAAAAGTGCAAGTCATCGGCCCAGATCGACCCAACTCCCGTCATCGTCACCTCCAACACCAACATGTGCGCGGTCATCGACGGAAATTCCACCACCTTCGAGCACCAACAACCACTCCAAGACCGGATGTTCAAGTTCGAGCTCACCAAGCGCCTGGAGCACGACTTTGGCAAGGTCACCAAGCAGGAAGTCAAGGACTTTTTCCGGTGGGCGTCAGATCACGTGACTGAGGTGTCTCACGAGTTTTACGTCAGAAAGGGTGGAGCTAGAAAGAGGCCCGCCCCCAATGACGCAGATATAAGTGAGCCCAAGCGGGCCTGTCCGTCAGTTGCGCAGCCATCGACGTCAGACGCGGAAGCTCCGGTGGACTACGCGGACAGGTACCAAAACAAATGTTCTCGTCACGTGGGCATGAATCTGATGCTTTTTCCCTGCCGGCAATGCGAGAGAATGAATCAGAATGTGGACATTTGCTTCACGCACGGGGTCATGGACTGTGCCGAGTGCTTCCCCGTGTCAGAATCTCAACCCGTGTCTGTCGTCAGAAAGCGGACATATCAGAAACTGTGTCCGATTCATCACATCATGGGGAGGGCGCCCGAGGTGGCTTGTTCGGCCTGCGATCTGGCCAATGTGGACTTGGATGACTGTGACATGGAGCAATAA

CapVP1: (SEQ ID NO:29)

ATGACTGACGGTTACCTTCCAGATTGGCTAGAGGACAACCTCTCTGAAGGCGTTCGAGAGTGGTGGGCGCTGCAACCTGGAGCCCCTAAACCCAAGGCAAATCAACAACATCAGGACAACGCTCGGGGTCTTGTGCTTCCGGGTTACAAATACCTCGGACCCGGCAACGGACTTGACAAGGGGGAACCCGTCAACGCAGCGGACGCGGCAGCCCTCGAACACGACAAGGCCTACGACCAGCAGCTCAAGGCCGGTGACAACCCCTACCTCAAGTACAACCACGCCGACGCCGAGTTTCAGGAGCGTCTTCAAGAAGATACGTCTTTTGGGGGCAACCTCGGACGAGCAGTCTTCCAGGCCAAAAAGAGGATCCTTGAGCCTCTGGGTCTGGTTGAGGAAGCGGCTAAGACGGCTCCTGGAAAAAAGAGACCTGTAGAGCAATCTCCAGCAGAACCGGACTCCTCTTCGGGCATCGGCAAATCAGGCCAGCAGCCCGCTAGAAAAAGACTGAATTTTGGTCAGACTGGCGACACAGAGTCAGTCCCAGACCCTCAACCACTCGGACAACCTCCCGCAGCCCCCTCTGGTGTGGGATCTACTACAATGGCTTCAGGCGGTGGCGCACCAATGGCAGACAATAACGAGGGTGCCGATGGAGTGGGTAATTCCTCAGGAAATTGGCATTGCGATTCCCAATGGCTGGGCGACAGAGTCATCACCACCAGCACCCGCACCTGGGCCCTGCCCACCTACAACAATCACCTCTACAAGCAAATCTCCAGCCAATCAGGAGCCACCAACGACAACCACTACTTTGGCTACAGCACCCCCTGGGGGTATTTTGACTTCAACAGATTCCACTGCCACTTTTCACCACGTGACTGGCAAAGACTCATCAACAACAACTGGGGATTCCGACCCAAGAGACTCAACTTCAAGCTCTTTAACATTCAAGTCAAAGAGGTCACGCAGAATGACGGTACGACGACGATTGCCAATAACCTTACCAGCACGGTTCAGGTGTTTACTGACTCCGAGTACCAGCTCCCGTACGTCCTCGGCTCGGCGCATCAGGGATGCCTCCCGCCGTTCCCAGCAGACGTCTTCATGGTCCCACAGTATGGATACCTCACCCTGAACAACGGGAGTCAGGCGGTAGGACGCTCTTCCTTTTACTGCCTGGAGTACTTTCCTTCTCAGATGCTGCGTACTGGAAACAACTTTCAGTTTAGCTACACTTTTGAAGACGTGCCTTTCCACAGCAGCTACGCTCACAGCCAAAGTCTGGACCGTCTCATGAATCCTCTGATCGACCAGTACCTGTACTATCTGAACAGGACACAAACAGCCAGTGGAACTCAGCAGTCTCGGCTACTGTTTAGCCAAGCTGGACCCACCAGTATGTCTCTTCAAGCTAAAAACTGGCTGCCTGGACCTTGCTACAGACAGCAGCGTCTGTCAAAGCAGGCAAACGACAACAACAACAGCAACTTTCCCTGGACTGGTGCCACCAAATATCATCTGAATGGCCGGGACTCATTGGTGAACCCGGGCCCTGCTATGGCCAGTCACAAGGATGACAAAGAAAAGTTTTTCCCCATGCATGGAACCCTGATATTTGGTAAAGAAGGAACAAATGCCAACAACGCGGATTTGGAAAATGTCATGATTACAGATGAAGAAGAAATCCGCACCACCAATCCCGTGGCTACGGAGCAGTACGGGACTGTGTCAAATAATTTGCAAAACTCAAACGCTGGTCCAACTACTGGAACTGTCAATCACCAAGGAGCGTTACCTGGTATGGTGTGGCAGGATCGAGACGTGTACCTGCAGGGACCCATTTGGGCCAAGATTCCTCACACCGATGGACACTTTCATCCTTCTCCACTGATGGGAGGTTTTGGGCTCAAACACCCGCCTCCTCAGATCATGATCAAAAACACTCCCGTTCCAGCCAATCCTCCCACAAACTTTAGTGCGGCAAAGTTTGCTTCCTTCATCACACAGTACTCCACGGGGCAGGTCAGCGTGGAGATCGAGTGGGAGCTGCAGAAGGAGAACAGCAAACGCTGGAATCCCGAAATTCAGTACACTTCCAACTACAACAAATCTGTTAATGTGGACTTTACTGTGGACACTAATGGTGTGTATTCAGAGCCTCGCCCCATTGGCACCAGATACCTGACTCGTAATCTGT AA

ITR Sequence (SEQ ID NO:30)

CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCAAAGCCCGGGCGTCGGGCGACCTTTGGTCGCCCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTCCT

Rep2 Sequence - Contains Rep78 and Rep52 (start codon underlined) (SEQID NO:31)

ATGCCGGGGTTTTACGAGATTGTGATTAAGGTCCCCAGCGACCTTGACGAGCATCTGCCCGGCATTTCTGACAGCTTTGTGAACTGGGTGGCCGAGAAGGAATGGGAGTTGCCGCCAGATTCTGACATGGATCTGAATCTGATTGAGCAGGCACCCCTGACCGTGGCCGAGAAGCTGCAGCGCGACTTTCTGACGGAATGGCGCCGTGTGAGTAAGGCCCCGGAGGCCCTTTTCTTTGTGCAATTTGAGAAGGGAGAGAGCTACTTCCACATGCACGTGCTCGTGGAAACCACCGGGGTGAAATCCATGGTTTTGGGACGTTTCCTGAGTCAGATTCGCGAAAAACTGATTCAGAGAATTTACCGCGGGATCGAGCCGACTTTGCCAAACTGGTTCGCGGTCACAAAGACCAGAAATGGCGCCGGAGGCGGGAACAAGGTGGTGGATGAGTGCTACATCCCCAATTACTTGCTCCCCAAAACCCAGCCTGAGCTCCAGTGGGCGTGGACTAATATGGAACAGTATTTAAGCGCCTGTTTGAATCTCACGGAGCGTAAACGGTTGGTGGCGCAGCATCTGACGCACGTGTCGCAGACGCAGGAGCAGAACAAAGAGAATCAGAATCCCAATTCTGATGCGCCGGTGATCAGATCAAAAACTTCAGCCAGGTACATGGAGCTGGTCGGGTGGCTCGTGGACAAGGGGATTACCTCGGAGAAGCAGTGGATCCAGGAGGACCAGGCCTCATACATCTCCTTCAATGCGGCCTCCAACTCGCGGTCCCAAATCAAGGCTGCCTTGGACAATGCGGGAAAGATTATGAGCCTGACTAAAACCGCCCCCGACTACCTGGTGGGCCAGCAGCCCGTGGAGGACATTTCCAGCAATCGGATTTATAAAATTTTGGAACTAAACGGGTACGATCCCCAATATGCGGCTTCCGTCTTTCTGGGATGGGCCACGAAAAAGTTCGGCAAGAGGAACACCATCTGGCTGTTTGGGCCTGCAACTACCGGGAAGACCAACATCGCGGAGGCCATAGCCCACACTGTGCCCTTCTACGGGTGCGTAAACTGGACCAATGAGAACTTTCCCTTCAACGACTGTGTCGACAAGATGGTGATCTGGTGGGAGGAGGGGAAGATGACCGCCAAGGTCGTGGAGTCGGCCAAAGCCATTCTCGGAGGAAGCAAGGTGCGCGTGGACCAGAAATGCAAGTCCTCGGCCCAGATAGACCCGACTCCCGTGATCGTCACCTCCAACACCAACATGTGCGCCGTGATTGACGGGAACTCAACGACCTTCGAACACCAGCAGCCGTTGCAAGACCGGATGTTCAAATTTGAACTCACCCGCCGTCTGGATCATGACTTTGGGAAGGTCACCAAGCAGGAAGTCAAAGACTTTTTCCGGTGGGCAAAGGATCACGTGGTTGAGGTGGAGCATGAATTCTACGTCAAAAAGGGTGGAGCCAAGAAAAGACCCGCCCCCAGTGACGCAGATATAAGTGAGCCCAAACGGGTGCGCGAGTCAGTTGCGCAGCCATCGACGTCAGACGCGGAAGCTTCGATCAACTACGCAGACAGGTACCAAAACAAATGTTCTCGTCACGTGGGCATGAATCTGATGCTGTTTCCCTGCAGACAATGCGAGAGAATGAATCAGAATTCAAATATCTGCTTCACTCACGGACAGAAAGACTGTTTAGAGTGCTTTCCCGTGTCAGAATCTCAACCCGTTTCTGTCGTCAAAAAGGCGTATCAGAAACTGTGCTACATTCATCATATCATGGGAAAGGTGCCAGACGCTTGCACTGCCTGCGATCTGGTCAATGTGGATTTGGATGACTG CATCTTTGAACAATAA

Cap2 Sequence - contains sequentially VP1, VP2, AAP, VP3 (start codonsunderlined) (SEQ ID NO:32)

ATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACACTCTCTCTGAAGGAATAAGACAGTGGTGGAAGCTCAAACCTGGCCCACCACCACCAAAGCCCGCAGAGCGGCATAAGGACGACAGCAGGGGTCTTGTGCTTCCTGGGTACAAGTACCTCGGACCCTTCAACGGACTCGACAAGGGAGAGCCGGTCAACGAGGCAGACGCCGCGGCCCTCGAGCACGACAAAGCCTACGACCGGCAGCTCGACAGCGGAGACAACCCGTACCTCAAGTACAACCACGCCGACGCGGAGTTTCAGGAGCGCCTTAAAGAAGATACGTCTTTTGGGGGCAACCTCGGACGAGCAGTCTTCCAGGCGAAAAAGAGGGTTCTTGAACCTCTGGGCCTGGTTGAGGAACCTGTTAAGACGGCTCCGGGAAAAAAGAGGCCGGTAGAGCACTCTCCTGTGGAGCCAGACTCCTCCTCGGGAACCGGAAAGGCGGGCCAGCAGCCTGCAAGAAAAAGATTGAATTTTGGTCAGACTGGAGACGCAGACTCAGTACCTGACCCCCAGCCTCTCGGACAGCCACCAGCAGCCCCCTCTGGTCTGGGAACTAATACGATGGCTACAGGCAGTGGCGCACCAATGGCAGACAATAACGAGGGCGCCGACGGAGTGGGTAATTCCTCGGGAAATTGGCATTGCGATTCCACATGGATGGGCGACAGAGTCATCACCACCAGCACCCGAACCTGGGCCCTGCCCACCTACAACAACCACCTCTACAAACAAATTTCCAGCCAATCAGGAGCCTCGAACGACAATCACTACTTTGGCTACAGCACCCCTTGGGGGTATTTTGACTTCAACAGATTCCACTGCCACTTTTCACCACGTGACTGGCAAAGACTCATCAACAACAACTGGGGATTCCGACCCAAGAGACTCAACTTCAAGCTCTTTAACATTCAAGTCAAAGAGGTCACGCAGAATGACGGTACGACGACGATTGCCAATAACCTTACCAGCACGGTTCAGGTGTTTACTGACTCGGAGTACCAGCTCCCGTACGTCCTCGGCTCGGCGCATCAAGGATGCCTCCCGCCGTTCCCAGCAGACGTCTTCATGGTGCCACAGTATGGATACCTCACCCTGAACAACGGGAGTCAGGCAGTAGGACGCTCTTCATTTTACTGCCTGGAGTACTTTCCTTCTCAGATGCTGCGTACCGGAAACAACTTTACCTTCAGCTACACTTTTGAGGACGTTCCTTTCCACAGCAGCTACGCTCACAGCCAGAGTCTGGACCGTCTCATGAATCCTCTCATCGACCAGTACCTGTATTACTTGAGCAGAACAAACACTCCAAGTGGAACCACCACGCAGTCAAGGCTTCAGTTTTCTCAGGCCGGAGCGAGTGACATTCGGGACCAGTCTAGGAACTGGCTTCCTGGACCCTGTTACCGCCAGCAGCGAGTATCAAAGACATCTGCGGATAACAACAACAGTGAATACTCGTGGACTGGAGCTACCAAGTACCACCTCAATGGCAGAGACTCTCTGGTGAATCCGGGCCCGGCCATGGCAAGCCACAAGGACGATGAAGAAAAGTTTTTTCCTCAGAGCGGGGTTCTCATCTTTGGGAAGCAAGGCTCAGAGAAAACAAATGTGGACATTGAAAAGGTCATGATTACAGACGAAGAGGAAATCAGGACAACCAATCCCGTGGCTACGGAGCAGTATGGTTCTGTATCTACCAACCTCCAGAGAGGCAACAGACAAGCAGCTACCGCAGATGTCAACACACAAGGCGTTCTTCCAGGCATGGTCTGGCAGGACAGAGATGTGTACCTTCAGGGGCCCATCTGGGCAAAGATTCCACACACGGACGGACATTTTCACCCCTCTCCCCTCATGGGTGGATTCGGACTTAAACACCCTCCTCCACAGATTCTCATCAAGAACACCCCGGTACCTGCGAATCCTTCGACCACCTTCAGTGCGGCAAAGTTTGCTTCCTTCATCACACAGTACTCCACGGGACAGGTCAGCGTGGAGATCGAGTGGGAGCTGCAGAAGGAAAACAGCAAACGCTGGAATCCCGAAATTCAGTACACTTCCAACTACAACAAGTCTGTTAATGTGGACTTTACTGTGGACACTAATGGCGTGTATTCAGAGCCTCGCCCCATTGGCACCAGATACCTGACTCGTA ATCTGTAA

Cap5 Sequence - contains sequentially VP1, VP2, AAP, VP3 (start codonsunderlined) (SEQ ID NO:33)

ATGGCTTTTGTTGATCACCCTCCAGATTGGTTGGAAGAAGTTGGTGAAGGTCTTCGCGAGTTTTTGGGCCTTGAAGCGGGCCCACCGAAACCAAAACCCAATCAGCAGCATCAAGATCAAGCCCGTGGTCTTGTGCTGCCTGGTTATAACTATCTCGGACCCGGAAACGGTCTCGATCGAGGAGAGCCTGTCAACAGGGCAGACGAGGTCGCGCGAGAGCACGACATCTCGTACAACGAGCAGCTTGAGGCGGGAGACAACCCCTACCTCAAGTACAACCACGCGGACGCCGAGTTTCAGGAGAAGCTCGCCGACGACACATCCTTCGGGGGAAACCTCGGAAAGGCAGTCTTTCAGGCCAAGAAAAGGGTTCTCGAACCTTTTGGCCTGGTTGAAGAGGGTGCTAAGACGGCCCCTACCGGAAAGCGGATAGACGACCACTTTCCAAAAAGAAAGAAGGCTCGGACCGAAGAGGACTCCAAGCCTTCCACCTCGTCAGACGCCGAAGCTGGACCCAGCGGATCCCAGCAGCTGCAAATCCCAGCCCAACCAGCCTCAAGTTTGGGAGCTGATACAATGTCTGCGGGAGGTGGCGGCCCATTGGGCGACAATAACCAAGGTGCCGATGGAGTGGGCAATGCCTCGGGAGATTGGCATTGCGATTCCACGTGGATGGGGGACAGAGTCGTCACCAAGTCCACCCGAACCTGGGTGCTGCCCAGCTACAACAACCACCAGTACCGAGAGATCAAAAGCGGCTCCGTCGACGGAAGCAACGCCAACGCCTACTTTGGATACAGCACCCCCTGGGGGTACTTTGACTTTAACCGCTTCCACAGCCACTGGAGCCCCCGAGACTGGCAAAGACTCATCAACAACTACTGGGGCTTCAGACCCCGGTCCCTCAGAGTCAAAATCTTCAACATTCAAGTCAAAGAGGTCACGGTGCAGGACTCCACCACCACCATCGCCAACAACCTCACCTCCACCGTCCAAGTGTTTACGGACGACGACTACCAGCTGCCCTACGTCGTCGGCAACGGGACCGAGGGATGCCTGCCGGCCTTCCCTCCGCAGGTCTTTACGCTGCCGCAGTACGGTTACGCGACGCTGAACCGCGACAACACAGAAAATCCCACCGAGAGGAGCAGCTTCTTCTGCCTAGAGTACTTTCCCAGCAAGATGCTGAGAACGGGCAACAACTTTGAGTTTACCTACAACTTTGAGGAGGTGCCCTTCCACTCCAGCTTCGCTCCCAGTCAGAACCTCTTCAAGCTGGCCAACCCGCTGGTGGACCAGTACTTGTACCGCTTCGTGAGCACAAATAACACTGGCGGAGTCCAGTTCAACAAGAACCTGGCCGGGAGATACGCCAACACCTACAAAAACTGGTTCCCGGGGCCCATGGGCCGAACCCAGGGCTGGAACCTGGGCTCCGGGGTCAACCGCGCCAGTGTCAGCGCCTTCGCCACGACCAATAGGATGGAGCTCGAGGGCGCGAGTTACCAGGTGCCCCCGCAGCCGAACGGCATGACCAACAACCTCCAGGGCAGCAACACCTATGCCCTGGAGAACACTATGATCTTCAACAGCCAGCCGGCGAACCCGGGCACCACCGCCACGTACCTCGAGGGCAACATGCTCATCACCAGCGAGAGCGAGACGCAGCCGGTGAACCGCGTGGCGTACAACGTCGGCGGGCAGATGGCCACCAACAACCAGAGCTCCACCACTGCCCCCGCGACCGGCACGTACAACCTCCAGGAAATCGTGCCCGGCAGCGTGTGGATGGAGAGGGACGTGTACCTCCAAGGACCCATCTGGGCCAAGATCCCAGAGACGGGGGCGCACTTTCACCCCTCTCCGGCCATGGGCGGATTCGGACTCAAACACCCACCGCCCATGATGCTCATCAAGAACACGCCTGTGCCCGGAAATATCACCAGCTTCTCGGACGTGCCCGTCAGCAGCTTCATCACCCAGTACAGCACCGGGCAGGTCACCGTGGAGATGGAGTGGGAGCTCAAGAAGGAAAACTCCAAGAGGTGGAACCCAGAGATCCAGTACACAAACAACTACAACGACCCCCAGTTTGTGGACTTTGCCCCGGACAGCACCGGGGAATACAGAAGCACCAGACCTATCGGAACCCGATACCTTACCCGACCCCTTTAA

Example 12 - Adenovirus Polynucleotide Sequences

Adenovirus (Ad) polynucleotides can be selected from any serotype, andrepresentative polynucleotides are exemplified below.

E2A Full Sequence (SEQ ID NO:34)

CGACCGCACCCTGTGACGAAAGCCGCCCGCAAGCTGCGCCCCTGAGTTAGTCATCTGAACTTCGGCCTGGGCGTCTCTGGGAAGTACCACAGTGGTGGGAGCGGGACTTTCCTGGTACACCAGGGCAGCGGGCCAACTACGGGGATTAAGGTTATTACGAGGTGTGGTGGTAATAGCCGCCTGTTCGAGGAGAATTCGGTTTCGGTGGGCGCGGATTCCGTTGACCCGGGATATCATGTGGGGTCCCGCGCTCATGTAGTTTATTCGGGTTGAGTAGTCTTGGGCAGCTCCAGCCGCAAGTCCCATTTGTGGCTGGTAACTCCACATGTAGGGCGTGGGAATTTCCTTGCTCATAATGGCGCTGACGACAGGTGCTGGCGCCGGGTGTGGCCGCTGGAGATGACGTAGTTTTCGCGCTTAAATTTGAGAAAGGGCGCGAAACTAGTCCTTAAGAGTCAGCGCGCAGTATTTGCTGAAGAGAGCCTCCGCGTCTTCCAGCGTGCGCCGAAGCTGATCTTCGCTTTTGTGATACAGGCAGCTGCGGGTGAGGGAGCGCAGAGACCTGTTTTTTATTTTCAGCTCTTGTTCTTGGCCCCTGCTTTGTTGAAATATAGCATACAGAGTGGGAAAAATCCTATTTCTAAGCTCGCGGGTCGATACGGGTTCGTTGGGCGCCAGACGCAGCGCTCCTCCTCCTGCTGCTGCCGCCGCTGTGGATTTCTTGGGCTTTGTCAGAGTCTTGCTATCCGGTCGCCTTTGCTTCTGTGTGACCGCTGCTGTTGCTGCCGCTGCCGCTGCCGCCGGTGCAGTAGGGGCTGTAGAGATGACGGTAGTAATGCAGGATGTTACGGGGGAAGGCCACGCCGTGATGGTAGAGAAGAAAGCGGCGGGCGAAGGAGATGTTGCCCCCACAGTCTTGCAAGCAAGCAACTATGGCGTTCTTGTGCCCGCGCCACGAGCGGTAGCCTTGGCGCTGTTGTTGCTCTTGGGCTAACGGCGGCGGCTGCTTAGACTTACCGGCCCTGGTTCCAGTGGTGTCCCATCTACGGTTGGGTCGGCGAACAGGCAGTGCCGGCGGCGCCTGAGGAGCGGAGGTTGTAGCGATGCTGGGAACGGTTGCCAATTTCTGGGGCGCCGGCGAGGGGAATGCGACCGAGGGTGACGGTGTTTCGTCTGACACCTCTTCGGCCTCGGAAGCTTCGTCTAGGCTGTCCCAGTCTTCCATCATCTCCTCCTCCTCGTCCAAAACCTCCTCTGCCTGACTGTCCCAGTATTCCTCCTCGTCCGTGGGTGGCGGCGGCGGCAGCTGCAGCTTCTTTTTGGGTGCCATCCTGGGAAGCAAGGGCCCGCGGCTGCTGATAGGGCTGCGGCGGCGGGGGGATTGGGTTGAGCTCCTCGCCGGACTGGGGGTCCAGGTAAACCCCCCGTCCCTTTCGTAGCAGAAACTCTTGGCGGGCTTTGTTGATGGCTTGCAATTGGCCAAGGATGTGGCCCTGGGTAATGACGCAGGCGGTAAGCTCCGCATTTGGCGGGCGGGATTGGTCTTCGTAGAACCTAATCTCGTGGGCGTGGTAGTCCTCAGGTACAAATTTGCGAAGGTAAGCCGACGTCCACAGCCCCGGAGTGAGTTTCAACCCCGGAGCCGCGGACTTTTCGTCAGGCGAGGGACCCTGCAGCTCAAAGGTACCGATAATTTGACTTTCGCTAAGCAGTTGCGAATTGCAGACCAGGGAGCGGTGCGGGGTGCATAGGTTGCAGCGACAGTGACACTCCAGTAGGCCGTCACCGCTCACGTCTTCCATGATGTCGGAGTGGTAGGCAAGGTAGTTGGCTAGCTGCAGAAGGTAGCAGTGACCCCAAAGCGGCGGAGGGCATTCACGGTACTTAATGGGCACAAAGTCGCTAGGAAGCGCACAGCAGGTGGCGGGCAGAATTCCTGAACGCTCTAGGATAAAGTTCCTAAAGTTTTGCAACATGCTTTGACTGGTGAAGTCTGGCAGACCCTGTTGCAGGGTTTTAAGCAGGCGTTCGGGGAAGATAATGTCCGCCAGGTGCGCGGCCACGGAGCGCTCGTTGAAGGCCGTCCATAGGTCCTTCAAGTTTTGCTTTAGCAGCTTCTGCAGCTCCTTTAGGTTGCGCTCCTCCAGGCATTGCTGCCACACGCCCATGGCCGTTTGCCAGGTGTAGCACAGAAATAAGTAAACGCAGTCGCGGACGTAGTCGCGGCGCGCCTCGCCCTTGAGCGTGGAATGAAGCACGTTTTGCCCGAGGCGGTTTTCGTGCAAAATTCCAAGGTAGGAGACCAGGTTGCAGAGCTCCACGTTGGAAATTTTGCAGGCCTGGCGCACGTAGCCCTGGCGAAAGGTGTAGTGCAACGTTTCCTCTAGCTTGCGCTGCATCTCCGGGTCAGCAAAGAACCGCTGCATGCACTCAAGCTCCACGGTAACAAGCACTGCGGCCATCATTAGCTTGCGTCGCTCCTCCAAGTCGGCAGGCTCGCGCGTCTCAAGCCAGCGCGCCAGCTGCTCATCGCCAACTGCGGGTAGGCCCTCCTCGGTTTGTTCTTGCAAGTTTGCATCCCTCTCCAGGGGTCGTGCACGGCGCACGATCAGCTCGCTCATGACTGTGCTCATAACCTTGGGGGGTAGGTTAAGTGCCGGGTAGGCAAAGTGGGTGACCTCGATGCTGCGTTTCAGCACGGCTAGGCGCGCGTTGTCACCCTCAAGTTCCACCAGCACTCCACAGTGACTTTCATTTTCGCTGTTTTCTTGTTGCAGAGCGTTTGCCGCGCGTTTCTCGTCGCGTCCAAGACCCTCAAAGATTTTTGGCACTTCGTCGAGCGAGGCGATATCAGGTATGACAGCGCCCTGCCGCAAGGCCAGCTGCTTGTCCGCTCGGCTGCGGTTGGCACGGCAGGATAGGGGTATCTTGCAGTTTTGGAAAAAGATGTGATAGGTGGCAAGCACCTCTGGCACGGCAAATACGGGGTAGAAGTTGAGGCGCGGGTTGGGCTCGCATGTGCCGTTTTCTTGGCGTTTGGGGGGTACGCGCGGTGAGAACAGGTGGCGTTCGTAGGCAAGGCTGACATCCGCTATGGCGAGGGGCACATCGCTGCGCTCTTGCAACGCGTCGCAGATAATGGCGCACTGGCGCTGCAGATGCTTCAACAGCACGTCGTCTCCCACATCTAGGTAGTCGCCATGCCTTTGGTCCCCCCGCCCGACTTGTTCCTCGTTTGCCTCTGCGTCGTCCTGGTCTTGCTTTTTATCCTCTGTTGGTACTGAGCGATCCTCGTCGTCTTCGCTTACAAAACCTGGGTCCTGCTCGATAATCACTTCCTCCTCCTCAAGCGGGGGTGCCTCGACGGGGAAGGTGGTAGGCGCGTTGGCGGCATCGGTGGAGGCGGTGGTGGCGAACTCAAAGGGGGCGGTTAGGCTGTCCTCCTTCTCGACTGACTCCATGATCTTTTTCTGCCTATAGGAGAAGGAAATGGCCAGTCGGGAAGAGGAGCAGCGCGAAACCACCCCCGAGCGCGGACGCGGTGCGGCGCGACGTCCACCAACCATGGAGGACGTGTCGTCCCCGTCGCCGTCGCCGCCGCCTCCCCGCGCGCCCCCAAAAAAGCGGCTGAGGCGGCGTCTCGAGTCCGAGGACGAAGAAGACTCGTCACAAGATGCGCTGGTGCCGCGCACACCCAGCCCGCGGCCATCGACCTCGACGGCGGATTTGGCCATTGCGTCCAAAAAGAAAAAGAAGCGCCCCTCTCCCAAGCCCGAGCGCCCGCCATCCCCAGAGGTGATCGTGGACAGCGAGGAAGAAAGAGAAGATGTGGCGCTACAAATGGTGGGTTTCAGCAACCCACCGGTGCTAATCAAGCACGGCAAGGGAGGTAAGCGCACGGTGCGGCGGCTGAATGAAGACGACCCAGTGGCGCGGGGTATGCGGACGCAAGAGGAAAAGGAAGAGTCCAGTGAAGCGGAAAGTGAAAGCACGGTGATAAACCCGCTGAGCCTGCCGATCGTGTCTGCGTGGGAGAAGGGCATGGAGGCTGCGCGCGCGTTGATGGACAAGTACCACGTGGATAACGATCTAAAGGCAAACTTCAAGCTACTGCCTGACCAAGTGGAAGCTCTGGCGGCCGTATGCAAGACCTGGCTAAACGAGGAGCACCGCGGGTTGCAGCTGACCTTCACCAGCAACAAGACCTTTGTGACGATGATGGGGCGATTCCTGCAGGCGTACCTGCAGTCGTTTGCAGAGGTAACCTACAAGCACCACGAGCCCACGGGCTGCGCGTTGTGGCTGCACCGCTGCGCTGAGATCGAAGGCGAGCTTAAGTGTCTACACGGGAGCATTATGATAAATAAGGAGCACGTGATTGAAATGGATGTGACGAGCGAAAACGGGCAGCGCGCGCTGAAGGAGCAGTCTAGCAAGGCCAAGATCGTGAAGAACCGGTGGGGCCGAAATGTGGTGCAGATCTCCAACACCGACGCAAGGTGCTGCGTGCATGACGCGGCCTGTCCGGCCAATCAGTTTTCCGGCAAGTCTTGCGGCATGTTCTTCTCTGAAGGCGCAAAGGCTCAGGTGGCTTTTAAGCAGATCAAGGCTTTCATGCAGGCGCTGTATCCTAACGCCCAGACCGGGCACGGTCACCTTCTGATGCCACTACGGTGCGAGTGCAACTCAAAGCCTGGGCATGCACCCTTTTTGGGAAGGCAGCTACCAAAGTTGACTCCGTTCGCCCTGAGCAACGCGGAGGACCTGGACGCGGATCTGATCTCCGACAAGAGCGTGCTGGCCAGCGTGCACCACCCGGCGCTGATAGTGTTCCAGTGCTGCAACCCTGTGTATCGCAACTCGCGCGCGCAGGGCGGAGGCCCCAACTGCGACTTCAAGATATCGGCGCCCGACCTGCTAAACGCGTTGGTGATGGTGCGCAGCCTGTGGAGTGAAAACTTCACCGAGCTGCCGCGGATGGTTGTGCCTGAGTTTAAGTGGAGCACTAAACACCAGTATCGCAACGTGTCCCTGCCAGTGGCGCATAGCGATGCGCGGCAGAACCCCTTTGATTTTTAAACGGCGCAGACGGCAAGGGTGGGGGGTAAATAATCACCCGAGAGTGTACAAATAAAAACATTTGCCTTTATTGAAAGTGTCTCCTAGTACATTATTTTTACATGTTTTTCAAGTGACAAAAAGAAGTGGCGCTCCTAATCTGCGCACTGTGGCTGCGGAAGTAGGGCGAGTGGCGCTCCAGGAAGCTGTAGAGCTGTTCCTGGTTGCGACGCAGGGTGGGCTGTACCTGGGGACTGTTAAGCATGGAGTTGGGTACC

E2A ORF Sequence (SEQ ID NO:35)

ATGGCCAGTCGGGAAGAGGAGCAGCGCGAAACCACCCCCGAGCGCGGACGCGGTGCGGCGCGACGTCCACCAACCATGGAGGACGTGTCGTCCCCGTCGCCGTCGCCGCCGCCTCCCCGCGCGCCCCCAAAAAAGCGGCTGAGGCGGCGTCTCGAGTCCGAGGACGAAGAAGACTCGTCACAAGATGCGCTGGTGCCGCGCACACCCAGCCCGCGGCCATCGACCTCGACGGCGGATTTGGCCATTGCGTCCAAAAAGAAAAAGAAGCGCCCCTCTCCCAAGCCCGAGCGCCCGCCATCCCCAGAGGTGATCGTGGACAGCGAGGAAGAAAGAGAAGATGTGGCGCTACAAATGGTGGGTTTCAGCAACCCACCGGTGCTAATCAAGCACGGCAAGGGAGGTAAGCGCACGGTGCGGCGGCTGAATGAAGACGACCCAGTGGCGCGGGGTATGCGGACGCAAGAGGAAAAGGAAGAGTCCAGTGAAGCGGAAAGTGAAAGCACGGTGATAAACCCGCTGAGCCTGCCGATCGTGTCTGCGTGGGAGAAGGGCATGGAGGCTGCGCGCGCGTTGATGGACAAGTACCACGTGGATAACGATCTAAAGGCAAACTTCAAGCTACTGCCTGACCAAGTGGAAGCTCTGGCGGCCGTATGCAAGACCTGGCTAAACGAGGAGCACCGCGGGTTGCAGCTGACCTTCACCAGCAACAAGACCTTTGTGACGATGATGGGGCGATTCCTGCAGGCGTACCTGCAGTCGTTTGCAGAGGTAACCTACAAGCACCACGAGCCCACGGGCTGCGCGTTGTGGCTGCACCGCTGCGCTGAGATCGAAGGCGAGCTTAAGTGTCTACACGGGAGCATTATGATAAATAAGGAGCACGTGATTGAAATGGATGTGACGAGCGAAAACGGGCAGCGCGCGCTGAAGGAGCAGTCTAGCAAGGCCAAGATCGTGAAGAACCGGTGGGGCCGAAATGTGGTGCAGATCTCCAACACCGACGCAAGGTGCTGCGTGCATGACGCGGCCTGTCCGGCCAATCAGTTTTCCGGCAAGTCTTGCGGCATGTTCTTCTCTGAAGGCGCAAAGGCTCAGGTGGCTTTTAAGCAGATCAAGGCTTTCATGCAGGCGCTGTATCCTAACGCCCAGACCGGGCACGGTCACCTTCTGATGCCACTACGGTGCGAGTGCAACTCAAAGCCTGGGCATGCACCCTTTTTGGGAAGGCAGCTACCAAAGTTGACTCCGTTCGCCCTGAGCAACGCGGAGGACCTGGACGCGGATCTGATCTCCGACAAGAGCGTGCTGGCCAGCGTGCACCACCCGGCGCTGATAGTGTTCCAGTGCTGCAACCCTGTGTATCGCAACTCGCGCGCGCAGGGCGGAGGCCCCAACTGCGACTTCAAGATATCGGCGCCCGACCTGCTAAACGCGTTGGTGATGGTGCGCAGCCTGTGGAGTGAAAACTTCACCGAGCTGCCGCGGATGGTTGTGCCTGAGTTTAAGTGGAGCACTAAACACCAGTATCGCAACGTGTCCCTGCCAGTGGCGCATAGCGATGCGCGGCAGAACCCCTTTGATTTTTAA

E4 Full Sequence (SEQ ID NO:36)

CCCGGGCGTTTTAGGGCGGAGTAACTTGCATGTATTGGGAATTGTAGTTTTTTTAAAATGGGAAGTGACGTATCGTGGGAAAACGGAAGTGAAGATTTGAGGAAGTTGTGGGTTTTTTGGCTTTCGTTTCTGGGCGTAGGTTCGCGTGCGGTTTTCTGGGTGTTTTTTGTGGACTTTAACCGTTACGTCATTTTTTAGTCCTATATATACTCGCTCTGTACTTGGCCCTTTTTACACTGTGACTGATTGAGCTGGTGCCGTGTCGAGTGGTGTTTTTTAATAGGTTTTTTTACTGGTAAGGCTGACTGTTATGGCTGCCGCTGTGGAAGCGCTGTATGTTGTTCTGGAGCGGGAGGGTGCTATTTTGCCTAGGCAGGAGGGTTTTTCAGGTGTTTATGTGTTTTTCTCTCCTATTAATTTTGTTATACCTCCTATGGGGGCTGTAATGTTGTCTCTACGCCTGCGGGTATGTATTCCCCCGGGCTATTTCGGTCGCTTTTTAGCACTGACCGATGTTAACCAACCTGATGTGTTTACCGAGTCTTACATTATGACTCCGGACATGACCGAGGAACTGTCGGTGGTGCTTTTTAATCACGGTGACCAGTTTTTTTACGGTCACGCCGGCATGGCCGTAGTCCGTCTTATGCTTATAAGGGTTGTTTTTCCTGTTGTAAGACAGGCTTCTAATGTTTAAATGTTTTTTTTTTTGTTATTTTATTTTGTGTTTAATGCAGGAACCCGCAGACATGTTTGAGAGAAAAATGGTGTCTTTTTCTGTGGTGGTTCCGGAACTTACCTGCCTTTATCTGCATGAGCATGACTACGATGTGCTTGCTTTTTTGCGCGAGGCTTTGCCTGATTTTTTGAGCAGCACCTTGCATTTTATATCGCCGCCCATGCAACAAGCTTACATAGGGGCTACGCTGGTTAGCATAGCTCCGAGTATGCGTGTCATAATCAGTGTGGGTTCTTTTGTCATGGTTCCTGGCGGGGAAGTGGCCGCGCTGGTCCGTGCAGACCTGCACGATTATGTTCAGCTGGCCCTGCGAAGGGACCTACGGGATCGCGGTATTTTTGTTAATGTTCCGCTTTTGAATCTTATACAGGTCTGTGAGGAACCTGAATTTTTGCAATCATGATTCGCTGCTTGAGGCTGAAGGTGGAGGGCGCTCTGGAGCAGATTTTTACAATGGCCGGACTTAATATTCGGGATTTGCTTAGAGACATATTGATAAGGTGGCGAGATGAAAATTATTTGGGCATGGTTGAAGGTGCTGGAATGTTTATAGAGGAGATTCACCCTGAAGGGTTTAGCCTTTACGTCCACTTGGACGTGAGGGCAGTTTGCCTTTTGGAAGCCATTGTGCAACATCTTACAAATGCCATTATCTGTTCTTTGGCTGTAGAGTTTGACCACGCCACCGGAGGGGAGCGCGTTCACTTAATAGATCTTCATTTTGAGGTTTTGGATAATCTTTTGGAATAAAAAAAAAAAAACATGGTTCTTCCAGCTCTTCCCGCTCCTCCCGTGTGTGACTCGCAGAACGAATGTGTAGGTTGGCTGGGTGTGGCTTATTCTGCGGTGGTGGATGTTATCAGGGCAGCGGCGCATGAAGGAGTTTACATAGAACCCGAAGCCAGGGGGCGCCTGGATGCTTTGAGAGAGTGGATATACTACAACTACTACACAGAGCGAGCTAAGCGACGAGACCGGAGACGCAGATCTGTTTGTCACGCCCGCACCTGGTTTTGCTTCAGGAAATATGACTACGTCCGGCGTTCCATTTGGCATGACACTACGACCAACACGATCTCGGTTGTCTCGGCGCACTCCGTACAGTAGGGATCGCCTACCTCCTTTTGAGACAGAGACCCGCGCTACCATACTGGAGGATCATCCGCTGCTGCCCGAATGTAACACTTTGACAATGCACAACGTGAGTTACGTGCGAGGTCTTCCCTGCAGTGTGGGATTTACGCTGATTCAGGAATGGGTTGTTCCCTGGGATATGGTTCTGACGCGGGAGGAGCTTGTAATCCTGAGGAAGTGTATGCACGTGTGCCTGTGTTGTGCCAACATTGATATCATGACGAGCATGATGATCCATGGTTACGAGTCCTGGGCTCTCCACTGTCATTGTTCCAGTCCCGGTTCCCTGCAGTGCATAGCCGGCGGGCAGGTTTTGGCCAGCTGGTTTAGGATGGTGGTGGATGGCGCCATGTTTAATCAGAGGTTTATATGGTACCGGGAGGTGGTGAATTACAACATGCCAAAAGAGGTAATGTTTATGTCCAGCGTGTTTATGAGGGGTCGCCACTTAATCTACCTGCGCTTGTGGTATGATGGCCACGTGGGTTCTGTGGTCCCCGCCATGAGCTTTGGATACAGCGCCTTGCACTGTGGGATTTTGAACAATATTGTGGTGCTGTGCTGCAGTTACTGTGCTGATTTAAGTGAGATCAGGGTGCGCTGCTGTGCCCGGAGGACAAGGCGTCTCATGCTGCGGGCGGTGCGAATCATCGCTGAGGAGACCACTGCCATGTTGTATTCCTGCAGGACGGAGCGGCGGCGGCAGCAGTTTATTCGCGCGCTGCTGCAGCACCACCGCCCTATCCTGATGCACGATTATGACTCTACCCCCATGTAGGCGTGGACTTCCCCTTCGCCGCCCGTTGAGCAACCGCAAGTTGGACAGCAGCCTGTGGCTCAGCAGCTGGACAGCGACATGAACTTAAGCGAGCTGCCCGGGGAGTTTATTAATATCACTGATGAGCGTTTGGCTCGACAGGAAACCGTGTGGAATATAACACCTAAGAATATGTCTGTTACCCATGATATGATGCTTTTTAAGGCCAGCCGGGGAGAAAGGACTGTGTACTCTGTGTGTTGGGAGGGAGGTGGCAGGTTGAATACTAGGGTTCTGTGAGTTTGATTAAGGTACGGTGATCAATATAAGCTATGTGGTGGTGGGGCTATACTACTGAATGAAAAATGACTTGAAATTTTCTGCAATTGAAAAATAAACACGTTGAAACATAACATGCAACAGGTTCACGATTCTTTATTCCTGGGCAATGTAGGAGAAGGTGTAAGAGTTGGTAGCAAAAGTTTCAGTGGTGTATTTTCCACTTTCCCAGGACCATGTAAAAGACATAGAGTAAGTGCTTACCTCGCTAGTTTCTGTGGATTCACTAGA A

E4 Orf6 Sequence (SEQ ID NO:37)

ATGACTACGTCCGGCGTTCCATTTGGCATGACACTACGACCAACACGATCTCGGTTGTCTCGGCGCACTCCGTACAGTAGGGATCGCCTACCTCCTTTTGAGACAGAGACCCGCGCTACCATACTGGAGGATCATCCGCTGCTGCCCGAATGTAACACTTTGACAATGCACAACGTGAGTTACGTGCGAGGTCTTCCCTGCAGTGTGGGATTTACGCTGATTCAGGAATGGGTTGTTCCCTGGGATATGGTTCTGACGCGGGAGGAGCTTGTAATCCTGAGGAAGTGTATGCACGTGTGCCTGTGTTGTGCCAACATTGATATCATGACGAGCATGATGATCCATGGTTACGAGTCCTGGGCTCTCCACTGTCATTGTTCCAGTCCCGGTTCCCTGCAGTGCATAGCCGGCGGGCAGGTTTTGGCCAGCTGGTTTAGGATGGTGGTGGATGGCGCCATGTTTAATCAGAGGTTTATATGGTACCGGGAGGTGGTGAATTACAACATGCCAAAAGAGGTAATGTTTATGTCCAGCGTGTTTATGAGGGGTCGCCACTTAATCTACCTGCGCTTGTGGTATGATGGCCACGTGGGTTCTGTGGTCCCCGCCATGAGCTTTGGATACAGCGCCTTGCACTGTGGGATTTTGAACAATATTGTGGTGCTGTGCTGCAGTTACTGTGCTGATTTAAGTGAGATCAGGGTGCGCTGCTGTGCCCGGAGGACAAGGCGTCTCATGCTGCGGGCGGTGCGAATCATCGCTGAGGAGACCACTGCCATGTTGTATTCCTGCAGGACGGAGCGGCGGCGGCAGCAGTTTATTCGCGCGCTGCTGCAGCACCACCGCCCTATCCTGATGCACGATTATGACTCTACCCCCATGTAG

VA Sequence (VA transcripts I and II are underlined) (SEQ ID NO:38)

CGTAATCCGTAGATGTACCTGGACATCCAGGTGATGCCGGCGGCGGTGGTGGAGGCGCGCGGAAAGTCGCGGACGCGGTTCCAGATGTTGCGCAGCGGCAAAAAGTGCTCCATGGTCGGGACGCTCTGGCCGGTGAGGCGTGCGCAGTCGTTGACGCTCTAGACCGTGCAAAAGGAGAGCCTGTAAGCGGGCACTCTTCCGTGGTCTGGTGGATAAATTCGCAAGGGTATCATGGCGGACGACCGGGGTTCGAACCCCGGATCCGGCCGTCCGCCGTGATCCATGCGGTTACCGCCCGCGTGTCGAACCCAGGTGTGCGACGTCAGACAACGGGGGAGCGCTCCTTTTGGCTTCCTTCCAGGCGCGGCGGCTGCTGCGCTAGCTTTTTTGGCCACTGGCCGCGCGCGGCGTAAGCGGTTAGGCTGGAAAGCGAAAGCATTAAGTGGCTCGCTCCCTGTAGCCGGAGGGTTATTTTCCAAGGGTTGAGTCGCAGGACCCCCGGTTCGAGTCTCGGGCCGGCCGGACTGCGGCGAACGGGGGTTTGCCTCCCCGTCATGCAAGACCCCGCTTGCAAATTCCTCCGGAAACAGGGACGAGCCCCTTTTTTGCTTTTCCCAGATGCATCCGGTGCTGCGGCAGATGCGCCCCCCTCCTCAGCAGCGGCAAGAGCAAGAGCAGCGGCAGACATGCAGGGCACCCTCCCCTTCTCCTACCGCGTCAGGAGGGGCAACATCCTACATCGA

Sequences for E1A and E1B are both contained within Accession AY339865.1

Ad5 E1A

Two proteins can be transcribed, a 32 kDa protein (first accessionnumber) and a 27 kDa protein (second accession number). These are bothsplice variants from the transcript:

-   Accession 1: AAQ19284.1-   Accession 2: AAQ19285.1

(SEQ ID NO:39)

ATGAGACATATTATCTGCCACGGAGGTGTTATTACCGAAGAAATGGCCGCCAGTCTTTTGGACCAGCTGATCGAAGAGGTACTGGCTGATAATCTTCCACCTCCTAGCCATTTTGAACCACCTACCCTTCACGAACTGTATGATTTAGACGTGACGGCCCCCGAAGATCCCAACGAGGAGGCGGTTTCGCAGATTTTTCCCGACTCTGTAATGTTGGCGGTGCAGGAAGGGATTGACTTACTCACTTTTCCGCCGGCGCCCGGTTCTCCGGAGCCGCCTCACCTTTCCCGGCAGCCCGAGCAGCCGGAGCAGAGAGCCTTGGGTCCGGTTTCTATGCCAAACCTTGTACCGGAGGTGATCGATCTTACCTGCCACGAGGCTGGCTTTCCACCCAGTGACGACGAGGATGAAGAGGGTGAGGAGTTTGTGTTAGATTATGTGGAGCACCCCGGGCACGGTTGCAGGTCTTGTCATTATCACCGGAGGAATACGGGGGACCCAGATATTATGTGTTCGCTTTGCTATATGAGGACCTGTGGCATGTTTGTCTACAGTCCTGTGTCTGAACCTGAGCCTGAGCCCGAGCCAGAACCGGAGCCTGCAAGACCTACCCGCCGTCCTAAAATGGCGCCTGCTATCCTGAGACGCCCGACATCACCTGTGTCTAGAGAATGCAATAGTAGTACGGATAGCTGTGACTCCGGTCCTTCTAACACACCTCCTGAGATACACCCGGTGGTCCCGCTGTGCCCCATTAAACCAGTTGCCGTGAGAGTTGGTGGGCGTCGCCAGGCTGTGGAATGTATCGAGGACTTGCTTAACGAGCCTGGGCAACCTTTGGACTTGAGCT GTAAACGCCCCAGGCCATAA

(SEQ ID NO:40)

ATGAGACATATTATCTGCCACGGAGGTGTTATTACCGAAGAAATGGCCGCCAGTCTTTTGGACCAGCTGATCGAAGAGGTACTGGCTGATAATCTTCCACCTCCTAGCCATTTTGAACCACCTACCCTTCACGAACTGTATGATTTAGACGTGACGGCCCCCGAAGATCCCAACGAGGAGGCGGTTTCGCAGATTTTTCCCGACTCTGTAATGTTGGCGGTGCAGGAAGGGATTGACTTACTCACTTTTCCGCCGGCGCCCGGTTCTCCGGAGCCGCCTCACCTTTCCCGGCAGCCCGAGCAGCCGGAGCAGAGAGCCTTGGGTCCGGTTTCTATGCCAAACCTTGTACCGGAGGTGATCGATCTTACCTGCCACGAGGCTGGCTTTCCACCCAGTGACGACGAGGATGAAGAGGGTCCTGTGTCTGAACCTGAGCCTGAGCCCGAGCCAGAACCGGAGCCTGCAAGACCTACCCGCCGTCCTAAAATGGCGCCTGCTATCCTGAGACGCCCGACATCACCTGTGTCTAGAGAATGCAATAGTAGTACGGATAGCTGTGACTCCGGTCCTTCTAACACACCTCCTGAGATACACCCGGTGGTCCCGCTGTGCCCCATTAAACCAGTTGCCGTGAGAGTTGGTGGGCGTCGCCAGGCTGTGGAATGTATCGAGGACTTGCTTAACGAGCCTGGGCAACCTTTGGACTTGAGCTGTAAACGCCCCAGGCCATAA

Ad5 E1B_19K Accession: AAQ19286.1

(SEQ ID NO:41)

ATGGAGGCTTGGGAGTGTTTGGAAGATTTTTCTGCTGTGCGTAACTTGCTGGAACAGAGCTCTAACAGTACCTCTTGGTTTTGGAGGTTTCTGTGGGGCTCATCCCAGGCAAAGTTAGTCTGCAGAATTAAGGAGGATTACAAGTGGGAATTTGAAGAGCTTTTGAAATCCTGTGGTGAGCTGTTTGATTCTTTGAATCTGGGTCACCAGGCGCTTTTCCAAGAGAAGGTCATCAAGACTTTGGATTTTTCCACACCGGGGCGCGCTGCGGCTGCTGTTGCTTTTTTGAGTTTTATAAAGGATAAATGGAGCGAAGAAACCCATCTGAGCGGGGGGTACCTGCTGGATTTTCTGGCCATGCATCTGTGGAGAGCGGTTGTGAGACACAAGAATCGCCTGCTACTGTTGTCTTCCGTCCGCCCGGCGATAATACCGACGGAGGAGCAGCAGCAGCAGCAGGAGGAAGCCAGGCGGCGGCGGCAGGAGCAGAGCCCATGGAACCCGAGAGCCGGCCTGGACCCTCGGGAATGA

Ad5 E1B_55K Accession: AAQ19287.1

(SEQ ID NO:42)

ATGGAGCGAAGAAACCCATCTGAGCGGGGGGTACCTGCTGGATTTTCTGGCCATGCATCTGTGGAGAGCGGTTGTGAGACACAAGAATCGCCTGCTACTGTTGTCTTCCGTCCGCCCGGCGATAATACCGACGGAGGAGCAGCAGCAGCAGCAGGAGGAAGCCAGGCGGCGGCGGCAGGAGCAGAGCCCATGGAACCCGAGAGCCGGCCTGGACCCTCGGGAATGAATGTTGTACAGGTGGCTGAACTGTATCCAGAACTGAGACGCATTTTGACAATTACAGAGGATGGGCAGGGGCTAAAGGGGGTAAAGAGGGAGCGGGGGGCTTGTGAGGCTACAGAGGAGGCTAGGAATCTAGCTTTTAGCTTAATGACCAGACACCGTCCTGAGTGTATTACTTTTCAACAGATCAAGGATAATTGCGCTAATGAGCTTGATCTGCTGGCGCAGAAGTATTCCATAGAGCAGCTGACCACTTACTGGCTGCAGCCAGGGGATGATTTTGAGGAGGCTATTAGGGTATATGCAAAGGTGGCACTTAGGCCAGATTGCAAGTACAAGATCAGCAAACTTGTAAATATCAGGAATTGTTGCTACATTTCTGGGAACGGGGCCGAGGTGGAGATAGATACGGAGGATAGGGTGGCCTTTAGATGTAGCATGATAAATATGTGGCCGGGGGTGCTTGGCATGGACGGGGTGGTTATTATGAATGTAAGGTTTACTGGCCCCAATTTTAGCGGTACGGTTTTCCTGGCCAATACCAACCTTATCCTACACGGTGTAAGCTTCTATGGGTTTAACAATACCTGTGTGGAAGCCTGGACCGATGTAAGGGTTCGGGGCTGTGCCTTTTACTGCTGCTGGAAGGGGGTGGTGTGTCGCCCCAAAAGCAGGGCTTCAATTAAGAAATGCCTCTTTGAAAGGTGTACCTTGGGTATCCTGTCTGAGGGTAACTCCAGGGTGCGCCACAATGTGGCCTCCGACTGTGGTTGCTTCATGCTAGTGAAAAGCGTGGCTGTGATTAAGCATAACATGGTATGTGGCAACTGCGAGGACAGGGCCTCTCAGATGCTGACCTGCTCGGACGGCAACTGTCACCTGCTGAAGACCATTCACGTAGCCAGCCACTCTCGCAAGGCCTGGCCAGTGTTTGAGCATAACATACTGACCCGCTGTTCCTTGCATTTGGGTAACAGGAGGGGGGTGTTCCTACCTTACCAATGCAATTTGAGTCACACTAAGATATTGCTTGAGCCCGAGAGCATGTCCAAGGTGAACCTGAACGGGGTGTTTGACATGACCATGAAGATCTGGAAGGTGCTGAGGTACGATGAGACCCGCACCAGGTGCAGACCCTGCGAGTGTGGCGGTAAACATATTAGGAACCAGCCTGTGATGCTGGATGTGACCGAGGAGCTGAGGCCCGATCACTTGGTGCTGGCCTGCACCCGCGCTGAGTTTGGCTCTAGCGATGAAGATACAGATTGA

Sequences for E2A and E4A are both contained within Accession MN088492

Ad5 E2A Orf Accession: QHX41645.1

(SEQ ID NO:43)

ATGGCCAGTCGGGAAGAGGAGCAGCGCGAAACCACCCCCGAGCGCGGACGCGGTGCGGCGCGACGTCCACCAACCATGGAGGACGTGTCGTCCCCGTCGCCGTCGCCGCCGCCTCCCCGCGCGCCCCCAAAAAAGCGGCTGAGGCGGCGTCTCGAGTCCGAGGACGAAGAAGACTCGTCACAAGATGCGCTGGTGCCGCGCACACCCAGCCCGCGGCCATCGACCTCGACGGCGGATTTGGCCATTGCGTCCAAAAAGAAAAAGAAGCGCCCCTCTCCCAAGCCCGAGCGCCCGCCATCCCCAGAGGTGATCGTGGACAGCGAGGAAGAAAGAGAAGATGTGGCGCTACAAATGGTGGGTTTCAGCAACCCACCGGTGCTAATCAAGCACGGCAAGGGAGGTAAGCGCACGGTGCGGCGGCTGAATGAAGACGACCCAGTGGCGCGGGGTATGCGGACGCAAGAGGAAAAGGAAGAGTCCAGTGAAGCGGAAAGTGAAAGCACGGTGATAAACCCGCTGAGCCTGCCGATCGTGTCTGCGTGGGAGAAGGGCATGGAGGCTGCGCGCGCGTTGATGGACAAGTACCACGTGGATAACGATCTAAAGGCAAACTTCAAGCTACTGCCTGACCAAGTGGAAGCTCTGGCGGCCGTATGCAAGACCTGGCTAAACGAGGAGCACCGCGGGTTGCAGCTGACCTTCACCAGCAACAAGACCTTTGTGACGATGATGGGGCGATTCCTGCAGGCGTACCTGCAGTCGTTTGCAGAGGTAACCTACAAGCACCACGAGCCCACGGGCTGCGCGTTGTGGCTGCACCGCTGCGCTGAGATCGAAGGCGAGCTTAAGTGTCTACACGGGAGCATTATGATAAATAAGGAGCACGTGATTGAAATGGATGTGACGAGCGAAAACGGGCAGCGCGCGCTGAAGGAGCAGTCTAGCAAGGCCAAGATCGTGAAGAACCGGTGGGGCCGAAATGTGGTGCAGATCTCCAACACCGACGCAAGGTGCTGCGTGCATGACGCGGCCTGTCCGGCCAATCAGTTTTCCGGCAAGTCTTGCGGCATGTTCTTCTCTGAAGGCGCAAAGGCTCAGGTGGCTTTTAAGCAGATCAAGGCTTTCATGCAGGCGCTGTATCCTAACGCCCAGACCGGGCACGGTCACCTTCTGATGCCACTACGGTGCGAGTGCAACTCAAAGCCTGGGCATGCACCCTTTTTGGGAAGGCAGCTACCAAAGTTGACTCCGTTCGCCCTGAGCAACGCGGAGGACCTGGACGCGGATCTGATCTCCGACAAGAGCGTGCTGGCCAGCGTGCACCACCCGGCGCTGATAGTGTTCCAGTGCTGCAACCCTGTGTATCGCAACTCGCGCGCGCAGGGCGGAGGCCCCAACTGCGACTTCAAGATATCGGCGCCCGACCTGCTAAACGCGTTGGTGATGGTGCGCAGCCTGTGGAGTGAAAACTTCACCGAGCTGCCGCGGATGGTTGTGCCTGAGTTTAAGTGGAGCACTAAACACCAGTATCGCAACGTGTCCCTGCCAGTGGCGCATAGCGATGCGCGGCAGAACCCCTTTGATTTTTAA

Ad5 E4A

Two proteins are present in this ORF. The first is a splice variantcontained within the ORF. The second is a non-spliced transcript presentin the ORF. Accession 1: QHX41659.1 Accession 2: QHX41660.1

(SEQ ID NO:44)

ATGACTACGTCCGGCGTTCCATTTGGCATGACACTACGACCAACACGATCTCGGTTGTCTCGGCGCACTCCGTACAGTAGGGATCGCCTACCTCCTTTTGAGACAGAGACCCGCGCTACCATACTGGAGGATCATCCGCTGCTGCCCGAATGTAACACTTTGACAATGCACAACGCGTGGACTTCCCCTTCGCCGCCCGTTGAGCAACCGCAAGTTGGACAGCAGCCTGTGGCTCAGCAGCTGGACAGCGACATGAACTTAAGCGAGCTGCCCGGGGAGTTTATTAATATCACTGATGAGCGTTTGGCTCGACAGGAAACCGTGTGGAATATAACACCTAAGAATATGTCTGTTACCCATGATATGATGCTTTTTAAGGCCAGCCGGGGAGAAAGGACTGTGTACTCTGTGTGTTGGGAGGGAGGTGGCAGGTTGAATACTAGGGTTCTG TGA

(SEQ ID NO:45)

ATGACTACGTCCGGCGTTCCATTTGGCATGACACTACGACCAACACGATCTCGGTTGTCTCGGCGCACTCCGTACAGTAGGGATCGCCTACCTCCTTTTGAGACAGAGACCCGCGCTACCATACTGGAGGATCATCCGCTGCTGCCCGAATGTAACACTTTGACAATGCACAACGTGAGTTACGTGCGAGGTCTTCCCTGCAGTGTGGGATTTACGCTGATTCAGGAATGGGTTGTTCCCTGGGATATGGTTCTGACGCGGGAGGAGCTTGTAATCCTGAGGAAGTGTATGCACGTGTGCCTGTGTTGTGCCAACATTGATATCATGACGAGCATGATGATCCATGGTTACGAGTCCTGGGCTCTCCACTGTCATTGTTCCAGTCCCGGTTCCCTGCAGTGCATAGCCGGCGGGCAGGTTTTGGCCAGCTGGTTTAGGATGGTGGTGGATGGCGCCATGTTTAATCAGAGGTTTATATGGTACCGGGAGGTGGTGAATTACAACATGCCAAAAGAGGTAATGTTTATGTCCAGCGTGTTTATGAGGGGTCGCCACTTAATCTACCTGCGCTTGTGGTATGATGGCCACGTGGGTTCTGTGGTCCCCGCCATGAGCTTTGGATACAGCGCCTTGCACTGTGGGATTTTGAACAATATTGTGGTGCTGTGCTGCAGTTACTGTGCTGATTTAAGTGAGATCAGGGTGCGCTGCTGTGCCCGGAGGACAAGGCGTCTCATGCTGCGGGCGGTGCGAATCATCGCTGAGGAGACCACTGCCATGTTGTATTCCTGCAGGACGGAGCGGCGGCGGCAGCAGTTTATTCGCGCGCTGCTGCAGCACCACCGCCCTATCCTGATGCACGATTATGACTCTACCCCCATGTAG

Ad5 VA Accession: AF369965.1

(SEQ ID NO:46)

TCGATGTAGGATGTTGCCCCTCCTGACGCGGTAGGAGAAGGGGAGGGTGCCCTGCATGTCTGCCGCTGCTCTTGCTCTTGCCGCTGCTGAGGAGGGGGGCGCATCTGCCGCAGCACCGGATGCATCTGGGAAAAGCAAAAAAGGGGCTCGTCCCTGTTTCCGGAGGAATTTGCAAGCGGGGTCTTGCATGACGGGGAGGCAAACCCCCGTTCGCCGCAGTCCGGCCGGCCCGAGACTCGAACCGGGGGTCCTGCGACTCAACCCTTGGAAAATAACCCTCCGGCTACAGGGAGCGAGCCACTTAATGCTTTCGCTTTCCAGCCTAACCGCTTACGCCGCGCGCGGCCAGTGGCCAAAAAAGCTAGCGCAGCAGCCGCCGCGCCTGGAAGGAAGCCAAAAGGAGCGCTCCCCCGTTGTCTGACGTCGCACACCTGGGTTCGACACGCGGGCGGTAACCGCATGGATCACGGCGGACGGCCGGATCCGGGGTTCGAACCCCGGTCGTCCGCCATGATACCCTTGCGAATTTATCCACCAGACCACGGAAGAGTGCCCGCTTACAGGCTCTCCTTTTGCACGGTCTAGAGCGTCAACGACTGCGCACGCCTCACCGGCCAGAGCGTCCCGACCATGGAGCACTTTTTGCCGCTGCGCAACATCTGGAACCGCGTCCGCGACTTTCCGCGCGCCTCCACCACCGCCGCCGGCATCACCTGGATGTCCAGGTACATCTACGGATTACG

Example 13 - Promoter, Operator, IRES and Intron Sequences

CMV Promoter (SEQ ID NO:47)

TAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCGTCGACGTTTAGTGAACCG

2xTet Operator Sequence (SEQ ID NO: 48)

TCCCTATCAGTGATAGAGATCTCCCTATCAGTGATAGAGA

hCMV Intron Sequence (SEQ ID NO: 49)

GTAAGTACCGCCTATAGAGTCTATAGGCCCACCCCCTTGGCTTCTTATGCATGCTATACTGTTTTTGGCTTGGGGTCTATACACCCCCGCTTCCTCATGTTATAGGTGATGGTATAGCTTAGCCTATAGGTGTGGGTTATTGACCATTATTGACCACTCCCCTATTGGTGACGATACTTTCCATTACTAATCCATAACATGGCTCTTTGCCACAACTCTCTTTATTGGCTATATGCCAATACACTGTCCTTCAGAGACTGACACGGACTCTGTATTTTTACAGGATGGGGTCTCATTTATTATTTACAAATTCACATATACAACACCACCGTCCCCAGTGCCCGCAGTTTTTATTAAACATAACGTGGGATCTCCACGCGAATCTCGGGTACGTGTTCCGGACATGGTCTCTTCTCCGGTAGCGGCGGAGCTTCTACATCCGAGCCCTGCTCCCATGCCTCCAGCGACTCATGGTCGCTCGGCAGCTCCTTGCTCCTAACAGTGGAGGCCAGACTTAGGCACAGCACGATGCCCACCACCACCAGTGTGCCGCACAAGGCCGTGGCGGTAGGGTATGTGTCTGAAAATGAGCTCGGGGAGCGGGCTTGCACCGCTGACGCATTTGGAAGACTTAAGGCAGCGGCAGAAGAAGATGCAGGCAGCTGAGTTGTTGTGTTCTGATAAGAGTCAGAGGTAACTCCCGTTGCGGTGCTGTTAACGGTGGAGGGCAGTGTAGTCTGAGCAGTACTCGTTGCTGCCGCGCGCGCCACCAGACATAATAGCTGACAGACTAACAGACTGTTCCTTTCCATGGGTCTTTTCTGCAG

ECMV IRES Sequence (SEQ ID NO: 50)

CCCCCCTCTCCCTCCCCCCCCCCTAACGTTACTGGCCGAAGCCGCTTGGAATAAGGCCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATATTGCCGTCTTTTGGCAATGTGAGGGCCCGGAAACCTGGCCCTGTCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGAATGTCGTGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGGCAGCGGAACCCCCCACCTGGCGACAGGTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCACAACCCCAGTGCCACGTTGGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAAGGGGCTGAAGGATGCCCAGAAGGTACCCCATTGTATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGGTTAAAAAACGTCTAGGCCCCCCGAACCACGGGGACGTGGTTTTCCTTTGAAAAACACGATTGCTCGAATCACC

FMDV IRES (SEQ ID NO: 51)

AGCAGGTTTCCCCAACTGACACAAAACGTGCAACTTGAAACTCCGCCTGGTCTTTCCAGGTCTAGAGGGGTAACACTTTGTACTGTGTTTGGCTCCACGCTCGATCCACTGGCGAGTGTTAGTAACAGCACTGTTGCTTCGTAGCGGAGCATGACGGCCGTGGGAACTCCTCCTTGGTAACAAGGACCCACGGGGCCAAAAGCCACGCCCACACGGGCCCGTCATGTGTGCAACCCCAGCACGGCGACTTTACTGCGAAACCCACTTTAAAGTGACATTGAAACTGGTACCCACACACTGGTGACAGGCTAAGGATGCCCTTCAGGTACCCCGAGGTAACACGCGACACTCGGGATCTGAGAAGGGGACTGGGGCTTCTATAAAAGCGCTCGGTTTAAAAAGCTTCTATGCCTGAATAGGTGACCGGAGGTCGGCACCTTTCCTTTACAA TTAATGACCCT

Example 14 - CHO and Mouse Stable Site 1 Sequences - U.S. Pat. No.7,771,997

-   211 > 6473-   <212> DNA-   <213> Cricetulus griseus-   <400> 1

(SEQ ID NO:52)

tctagaaaca aaaccaaaaa tattaagtca ggcttggctt caggtgctgg ggtggagtgc 60tgacaaaaat acacaaattc ctggctttct aaggcttttt cggggattca ggtattgggt 120gatggtagaa taaaaatctg aaacataggt gatgtatctg ccatactgca tgggtgtgta 180tgtgtgtgta tgtgtgtctg tgtgtgtgcc cagacagaaa taccatgaag gaaaaaaaca 240cttcaaagac aggagagaag agtgacctgg gaaggactcc ccaatgagat gagaactgag 300cacatgccag aggaggtgag gactgaacca ttcaacacaa gtggtgaata gtcctgcaga 360cacagagagg gccagaagca ctcagaactc cagggggtca ggagtggttc tctggaggct 420tctgcccttg gaggttcctg aggaggaggc ttccatattg aaaatgtagt tagtggccgt 480ttccattagt acagtgacta gagagagctg agggaccact ggactgaggc ctagatgctc 540agtcagatgg ccatgaaagc ctagacaagc acttccgggt ggaaaggaaa cagcaggtgt 600gaggggtcag gggcaagtta gtgggagagg tcttccagat gaagtagcag gaacggagac 660gcactggatg gccccacttg tcaaccagca aaagcttgga tcttgttcta agaggccagg 720gacatgacaa gggtgatctc ggtttttaaa aggctttgtg ttacctaatc acttctatta 780gtcagatact ttgtaacaca aatgagtact tggcctgtat tttagaaact tctgggatcc 840tgaaaaaaca caatgacatt ctggctgcaa cacctggaga ctcccagcca ggccctggac 900ccgggtccat tcatgcaaat actcagggac agattcttca ctaggtactg atgagctgtc 960ttggatgcaa atgtggcctc ttcattttac tacaagtcac catgagtcag gaggtgctgt 1020ttgcacagtg tgactaagtg atggagtgtt gactgcagcc attcccggcc ccagcttgtg 1080agagagatcc ttttaaattg aaagtaagct caaagttacc acgaagccac acatgtataa 1140actgtgtgaa taatctgtgc acatacacaa accatgtgaa taatctgtgt acatgtataa 1200actgtgtgaa taatctgtgt gcagcctttc cttacctact accttccagt gatcaggttt 1260ggactgcctg tgtgctactg gaccctgaat gtccccaccg ctgtcccctg tcttttacga 1320ttctgacatt tttaataaat tcagcggctt cccctctgct ctgtgcctag ctataccttg 1380gtactctgca ttttggtttc tgtgacattt ctctgtgact ctgctacatt ctcagatgac 1440atgtgacaca gaaggtgttc cctctggaga catgtgatgt ccctgtcatt agtggaatca 1500gatgccccca aactgttgtc cagtgtttgg gaaagtgaca cgtgaaggag gatcaggaaa 1560agaggggtgg aaatcaagat gtgtctgagt atctcatgtc cctgagtggt ccaggctgct 1620gacttcactc ccccaagtga gggaggccat ggtgagtaca cacacctcac acatactata 1680tccaacacac acacacacac acacacacac acgcacgcac gcacgcacgc acgcacacat 1740gcacacacac gaactacatt tcacaaacca catacgcata ttacacccca aacgtatcac 1800ctatacatac cacacataca cacccctcca cacatcacac acataccaca cccacacaca 1860gcacacacat acataggcac acattcacac accacacata tacatttgtg tatgcataca 1920tgcatacaca cacaggcaca cagacaccac acacatgcat tgtgtacgca cacatgcata 1980cacacacata ggcacacatt gagcacacac atacatttgt gtacgcacac tacatagaca 2040tatatgcatt tgtatatgca cacatgcatg cacacataca taggcacaca tagagcacac 2100acatacattt gtgtatgcac acatgcacac accaatcaca tgggaagact caggttcttc 2160actaaggttc acatgaactt agcagttcct ggttatctcg tgaaacttgg aagattgctg 2220tggagaagag gaagcgttgg cttgagccct ggcagcaatt aaccccgccc agaagaagta 2280ggtttaaaaa tgagagggtc tcaatgtgga acccgcaggg cgccagttca gagaagagac 2340ctacccaagc caactgagag caaaggcaga gggatgaacc tgggatgtag tttgaacctc 2400tgtaccagct gggcttcatg ctattttgtt atatctttat taaatattct tttagtttta 2460tgtgcgtgaa taccttgctt gcataaatgt atgggcactg tatgtgttct tggtgccggt 2520ggaggccagg agagggcatg gatcctccgg agctggcgtt tgagacagtt gtgacccaca 2580gtgtggggtc tgggaactgg gtcttagtgt tccgcaagtg cagctggggc tcttaacctc 2640tgagccatcc ctccagcttc aagaaactta ttttcttagg acatggggga agggatccag 2700ggctttaggc ttgtttgttc agcaaatact cttttcgtgt attttgaatt ttattttatt 2760ttactttttt gggatagaat cacattctgc agctcaggct gggcctgaac tcatcaaaat 2820cctcctgtct cagtctacca ggtgataaga ttactgatgt gagcctggct ttgacaagca 2880ctttagagtc cccagccctt ctggacactt gttccaagta taatatatat atatatatat 2940atatatatat atatatatat atatattgtg tgtgtgtgtt tgtgtgtgta tgagacactt 3000gctctaaggg tatcatatat atccttgatt tgcttttaat ttatttttta attaaaaatg 3060attagctaca tgtcacctgt atgcgtctgt atcatctata tatccttcct tccttctctc 3120tctttctctc ttcttcttct cacccccaag catctatttt caaatccttg tgccgaggag 3180atgccaagag tctcgttggg ggagatggtg agggggcgat acaggggaag agcaggagga 3240aagggggaca gactggtgtg ggtctttgga gagctcagga gaatagcagc gatcttccct 3300gtccctggtg tcacctctta cagccaacac cattttgtgg cctggcagaa gagttgtcaa 3360gctggtcgca ggtctgccac acaaccccaa tctggcccca agaaaaggca cctgtgtgtg 3420actctggggt taaaggcgct gcctggtcgt ctccagctgg acttgaaact cccgtttaat 3480aaagagttct gcaaaataat acccgcagag tcacagtgcc aggttcccgt gctttcctga 3540agcgccaggc acgggttccc taggaaatgg ggccttgctt gccaagctcc cacggcttgc 3600cctgcaaacg gcctgaatga tctggcactc tgcgttgcca ctgggatgaa atggaaaaaa 3660gaaaaagaag aagtgtctct ggaagcgggc gcgctcacac aaacccgcaa cgattgtgta 3720aacactctcc attgagaatc tggagtgcgg ttgccctcta ctggggagct gaagacagct 3780agtgggggcg gggggaggac cgtgctagca tccttccacg gtgctcgctg gctgtggtgc 3840atgccgggaa ccgaaacgcg gaactaaagt caagtcttgc tttggtggaa ctgacaatca 3900acgaaatcac ttcgattgtt ttcctctttt tactggaatt cttggatttg atagatgggg 3960gaggatcaga gggggagggg aggggcgggg agacggaggg aggaggggag gaggggagga 4020ggggaggagg ggaggagggg aagggatgga ggaaaatact aacttttcta attcaacatg 4080acaaagattc ggagaaagtg caccgctagt gaccgggagg aggaatgccc tattgggcat 4140tatattccct gtcgtctaat ggaatcaaac tcttggttcc agcaccaagg attctgagcc 4200tatcctattc aagacagtaa ctacagccca cacggaagag gctatacaac tgaagaaata 4260aaattttcac tttatttcat ttctgtgact gcatgttcac atgtagagag ccacctgtgt 4320ctaggggctg atgtgctggg cagtagagtt ctgagcccgt taactggaac aacccagaac 4380tcccaccaca gttagagctt gctgagagag ggaggccctt ggtgagattt ctttgtgtat 4440ttatttagag acagggtctc atactgtagt ccaagctagc ctccagctca cagaaattct 4500cctgttccgg tttccaaagt actggagtta tgagtgtgtg ttaattgaac gctaagaatt 4560tgctgattga agaaaacctc aagtgggttt ggctaatccc cacgacccca gaggctgagg 4620caggaggaat gagagaattc aaggtttgcc agagccacag ggtgagctca atgtggagac 4680tgtgagggtg agctcaatgt ggagactgtg agggtgagct caatgtggag actgtgaggg 4740tgagctcaat gtggagactg tgagggtgag ctcaatgtgg agactgtgag ggtgagctca 4800atgtggagac ctgtatcaag ataataatag tagtagtaac aatgcaggcg agggtgtggt 4860tgagtggtag agcagttagt tgatttgaca tgcttgaggt ctcccggtcc atctgtggcc 4920ctgcaacagg aagggaggga ggaagggggg gaacgagaga gaggaaagag agacagaagc 4980taagataggg aatgagagag gaaggaagaa acgggaagaa attcagactc cttcctgagt 5040tccgccaacg cctagtgaca tcctgtgcac accctaaggt ggcctttgtg tggcactggc 5100ttgggtggtc gggaaaggca ttttcagctt gttgcagaac tgccacagta gcatgctggg 5160tccgtgaaag tttctgcccg ttaacaagaa gtctctacta cttgtgacct caccagtgaa 5220aatttcttta attgtctcct ggtgttctgg gttttgcatt tttgtttcta aggatacatt 5280cctgggtgat gtcatgaagt ccccaaagac acagtggggc tgtgttggat tgggaaagat 5340gatttatctg gggtgtcaaa aggaaaagaa gggaaacagg cacttgggaa aatgtcctcc 5400cgcccacccg aattttggct tggcaaccgt ggtggaggag caagaaacac gtggacgttt 5460gaggaggcat ggggtcctag gaggacagga agcagaagga gagagctggg ctgacagcct 5520gcaggcattg cacagtttca gaaggagatt acagcatgac tgagttttta gggatccaac 5580agggacctgg gtagagattc tgtgggctct gaggcaactt gacctcagcc agatggtatt 5640tgaataacct gctcttagag ggaaaacaga catagcaaac agagccacgt ttagtgatga 5700aactctcact ttgcctgagt catgtgcggc catgcccagg ggtcaggctg acactcaact 5760caaaaacaag tgagaaattg aagacaatcc gtggtggcag ctactggaag ggccaccaca 5820tccccagaaa gagtggagct gctaaaaagc catttgtgat aggcacagtt atcttgaatg 5880catggagcag agattacgga aaaatcgaga atgttaatga ggcaacattc gagttgagtc 5940attcagtgtg ggaaacccag acgcttccat cccctaaaag gaacatcttg ctctcagtca 6000aaatggaaat aaaaattggg gcttgaattt ggcaaatgat tcagaactct gtgtaggtat 6060tttcacacgc acagtggata attttcatgt tggagtttat ttgtgctaaa aggcagaaaa 6120gggtaaaaag cacatcttaa gagttatgag gttctacgaa taaaaataat gttacttaca 6180gctattcctt aattagtacc cccttccacc tgtggtaatt tcctgagata gtcagtgggg 6240aaaagatctc tccttctctt ctttctcccc ctcccctcct ctccctccct ccctccctcc 6300ctccctcctc tccctccctc cccctttcct tctttctttg ctccttctcc tctgcctcct 6360tctccctttc ttcttcattt attctaagta gcttttaaca gcacaccaat tacctgtgta 6420taacgggaaa acacaggctc aagcagctta gagaagattg atctgtgttc act        6473

-   <211> 7045-   <212> DNA-   <213> Cricetulus griseus-   <400> 2

(SEQ ID NO: 53)

actagcgtgc aattcagagg tgggtgaaga taaaaggcaa acatttgagg ccatttcctt 60atttggcacg gcacttagga agtggaacat gcctaatcta ctggtttgta ccacctttcc 120ctataatgga ctgtttggga agctcctggg caaccgattc tggcatctca ttggtcagag 180gcctgttaaa tggtactctt atttgcaaag aaggctgtaa cttgtagctt taaaagcctc 240tcctcaagaa agaagggaga aaggatatgg ctagacatat ctaatagact taaccactgt 300gaaaagcctt agtatgaatc agatagaacc tatttttaac tcagttttga aaaaaataat 360ctttatattt atttgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 420gaaccacatg tagcaggtgc tggaggaggc cagaagaggg caccagatct cctggaactg 480acaccacaca tggttatgag ctgcctgatg tgggtgctgg gaactgaact ctcgtgttct 540gcaagagcag caactgttct cttaactgat gagccatctc tccagccccc cccataattt 600taattgttca ttttagtaaa ttttattcat aatcaattat cacagtataa aacaatgatt 660ttatatatat catatacata tcaaggatga cagtgagggg gatatgtgtg tgtgtgtgtg 720tgtgtgtgtg tgtgtgtgtg tgtgttattt gtgtgtgtgc tttttaagaa ggtgccatag 780tcactgcatt tctctgaagg atttcaaagg aatgagacat gtctgtctgc caggaaccct 840atcttcctct ttgggaatct gacccaaatg aggtattctg aggaactgaa tgaagagctc 900aagtagcagt gtcttaaacc caaatgtgct gtctagagaa agtcaacgtc atcagtgagc 960tgaggagaga tttactgagc ggaagacaag cgctctttga tttaagtggc tcgaacagtc 1020acggctgtgg agtggagcct gtgctcaggt ctgaggcagt ctttgctagc cagctgtgat 1080gagcagtgaa gaaagggtgg agatggaggc agggtgggag cagggctatg gttcagacta 1140ggtatcgtga gcacaccagc tggttgactt gtggtctgtg ggtcaggcgt tgtaaacgcc 1200ctcagggtca ggcagtcaca ttgcttgaag ctgaatgggt gaggcaacac agagagtgca 1260aagaaggcaa agtaccacct cttccccgac ccaggtcact tctgggttat agctgagact 1320ccggacagca tgcaaccagc tggttagagc ttcagggaaa acttgatgtc tgcatgttgc 1380tatgaaatgt gattcggtac atctggagaa aatttataat gctggctcag tcaagcactg 1440aacaaaggta ccttggcttt gggagctaca tgacattgac ttgtaggcag actttttttt 1500ttctgcccgc caattcccag ataaccaata tggaggctca atattaatta taaatgctcg 1560gctgatagct caggcttgtt actagctaac tcttccaact taaatgaacc catttctatt 1620atctacattc tgccacgtga ctttaccttg tacttcctgt ttcctctcct tgtctgactc 1680tgcccttctg cttcccagag tccttagtct ggttctcctg cctaacctta tcctgcccag 1740ctgctgacca agcatttata attaatatta agtctcccag tgagactctc atccagggag 1800gacttgggtg ctcccccctc ctcattgcca tccgtgtctt cctcttccct cgcttccccc 1860tcctcttcct gctcttcctc ctccacccct cctttcatag tattgatggc aagggtgttc 1920tagaatggag gagtgcccat aggcatgcaa agaaaccagt taggatgctc tgtgaggggt 1980tgtaatcata agcgatggac acaattcaag ccacagagtg aagacggaag gatgcactgt 2040gctctagagc aacttctggg gcagaatcac agggtgagtt tctgacttga gggcgaagag 2100gccacgagga agggagtgag tttgtctgag ctagaagcta cggcccacct cttggtagca 2160gacctgccca caagcatgct ttgttaatca tgtgggatct gattttcctc taaatctatg 2220ttcaactctt aagaaaatgt gaattctcac attaaaattt agatatacgt cttttggtgg 2280ggggggtgta aaaaatcctc aagaatatgg atttctgggg gccggagaga tggctcagag 2340gttaagagaa ctggttgctc ttctagacat tctgagttca attcccagca accacatggt 2400ggctcacaac catctgtaat gcgacctggt gccatcttct gacatgcatg gatacatgca 2460ggcagaaagc tgtatacata gtaaattgat aaatcttttt ttaaaaagag tatggattct 2520gccgggtgtt ggtggcgcac gcctttaatc ccagcactct ggaggcagag gcaggtggat 2580ctctgtgagt tcgagaccag cctggtctat aagagctagt tccaggacag cctccaaagc 2640cacagagaaa ccctgtctcg aaaaaccaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaga 2700gtatggattc taagaaagcc gtaacagctg gagctgtgta cggagttcag cgtggtacta 2760gaagaacaga cattcatgat gaaacacccc aggattttta cttagtatct agtttccatt 2820gttgttttga gaccggctct tatgctctcc aggctggcct caaactgctg atcttcccgc 2880ctctacctct caagtcctgg gactacttgg ctcataaaac agtttttgtc gggctccctg 2940aagttatggt tgtacaaacc gtgggggtca atatactcac ttgggcagag agagaaggtc 3000tgaatcccag acaatgactg catctcagga cagttgggaa gaggacaatg gcagaaggac 3060ttagaaaaga tagactggag ggtggaaaag cagcaggaac agagaaacaa aacaggaagc 3120ttgctatcca gggccactct ggagtcctgt ggcaagatgg aagcgggcta ggggaataca 3180tttgtgctac tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgat caatgcctat 3240caatgttgaa ggggaaatat gtataccaca ttgattctgg gagcaattct cagtatctgg 3300cctagagaaa ggaatggccc ctgcagaata gacagagtga atggtgccct ttatcatttg 3360ctaaagtgaa ggagaaataa acatccttcc atagagtttc aggtaaatga accccacagt 3420tcatctgtgc cgtggtggag gcctggccaa cagttaaaaa gattagacac ggacaaagtc 3480tgaaggaaac acctcgaata ggaagaggag agccacctca ttctgtaact ttcctcaagg 3540ggaagatgtt ccaagagtgg gaataaatgg tcaaaggggg gatttttaat taggaaaacg 3600atttcctgta tcacttgtga aactggaggt tgatttgggg cataggacaa tagatttgat 3660gctttgcaaa aagctgtttc aaagcagaga aatggaatag agacaattat gtagcgagga 3720gggagggtgg ggcgaagatg gagacagaga agtggaagct gactttaggg aagaggaaca 3780tagaccacag gggcggggcg gggggcaggg gcggggggcg gggctcaaag gaggcagtgg 3840gaacgttgct agtgttcgca gcgtaagcgt gaatgtgcaa gcgtctttgt ggtgtgtgac 3900caggagtagc gtggctggct tgtgtgctgc ttgtaatccc agtctttgag gtttccacac 3960tgttccacag tgggtgtgat tttccctcgg agagcatgag ggctctgctt tccccacatc 4020ctccccagcg ttcgttggta tttgtttcca agatgttagt gggtgagaca aagcctctct 4080gttgatttgc ctttaacagg tgacaaaaaa agctcaacca ggagacattt ttgccttctt 4140ggaaggtaat gctcccatgt agagcaatgg gacccatctc taaggtgagg ctactcttgc 4200agtttgcacc cagctcttct gatgcaggaa ggaagttggt gggcaagcaa gactgtttgc 4260ttcttgcgat ggacacattc tgcacacaaa ggctcaggag gggagaaggc tgtttgatgt 4320ttagcactca ggaaggcccc tgatgcatct gtgattagct gtctccatct gtggagcaga 4380cacggactaa ctaaaaacca gtgtttttaa attgtcaagc ctttaaggtg aggaaattga 4440cttattgtgc tgggccatac gtagagcaag tgctctgcat tgggccaacc cccggctctg 4500gtttctaggc accagaatgg cctagaacta actcacaatc ctcccattcc aggtctcagg 4560tgctagaatg aaccactata ccagcctgcc tgcctgccta cctgccttcc taaattttaa 4620atcatgggga gtaggggaga atacacttat cttagttagg gtttctattg ctgtgaagag 4680acaccatgag catggcaact cttataaagg aaaacattta gttgggtggc agtttcagag 4740gttttagtac attgtcatca tggctgggaa catgatggca tgcagacaga catggtgctg 4800gagaaaggga tgagagtcct acatcttgca ggcaacagga cctcagctga gacactggct 4860ggtaccctga gcataggaaa cctcacagcc caccctcaca gtgacatatt tccttcaaca 4920aagccatacc tcctaatagt gccactccct atgagatgac agggccaatt acattcaaac 4980tgctataaca ctttaaagta ttttattttt attattgtaa attatgtatg tagctgggtg 5040gtggcagccg aggtgcacgc ctttaatccc agcacttggg aggcagaggc agatggatct 5100ctgtgagttc aagaccagcc tggtctataa gagctagttg caaggaagga tatacaaaga 5160acagttctag gatagccttc aaagccacag agaagtgctg tcttgaaaac caaaaattgt 5220gctgggacct gtctctgctt tggttgcttc ccactccccc agagctggac tcttggtcaa 5280cactgaatca gctgcaaaat aaactcctgg attcctctct tgtaacagga gcccgaagtc 5340aggcgcccac ttgtcttctc gcaggattgc catagacttt ttctgtgtgc ccaccattcc 5400agactgaagt agagatggca gtggcagaga ctgggaaggc tgcaacgaaa acaggaagtt 5460attgcaccct gggaatagtc tggaaatgaa gcttcaaaac ttgcttcatg ttcagttgta 5520cacagactca ctcccaggtt gactcacacg tgtaaatatt cctgactatg tctgcactgc 5580ttttatctga tgcttccttc ccaaaatgcc aagtgtacaa ggtgagggaa tcacccttgg 5640attcagagcc cagggtcgtc ctccttaacc tggacttgtc tttctccggc agcctctgac 5700acccctcccc ccattttctc tatcagaagg tctgagcaga gttggggcac gctcatgtcc 5760tgatacactc cttgtcttcc tgaagatcta acttctgacc cagaaagatg gctaaggtgg 5820tgaagtgttt gacatgaaga cttggtctta agaactggag caggggaaaa aagtcggatg 5880tggcagcatg tacccgaaat cccagaactg gggaggtaga gacggatgag tgcccggggc 5940tagctggctg ctcagccagc ctagctgaat tgccaaattc caactcctat tgaaaaacct 6000ttaccaaaca aacaaacaaa caaataataa caacaacaac aacaacaaac taccccatac 6060aaggtgggcg gctcttggct cttgaggaat gactcaccca aacccaaagc ttgccacagc 6120tgttctctgg cctaaatggg gtgggggtgg ggcagagaca gagacagaga gagacatgac 6180ttcctgggct gggctgtgtg ctctaggcca ccaggaactt tcctgtcttg ctctctgtct 6240ggcacagcca gagcaccagc acccagcagg tgcacacacc tccctccgtg cttcttgagc 6300aaacacaggt gccttggtct gtctattgaa ccggagtaag ttcttgcaga tgtatgcatg 6360gaaacaacat tgtcctggtt ttatttctac tgttgtgata aaaaccgggg aactccagga 6420agcagctgag gcagaggcaa atgcaaggaa tgctgcctcc tagcttgctc cccatggctt 6480gccgggcctg ctttctgcaa gcccttctct ccccattggc atgcctgaca tgaacagcgt 6540ttgaaatgct ctcaaatgtc actttcaaag aaggcttctc tgatcttgct aactaaatca 6600gaccatgttt caccgtgcat tatctttctg ctgtctgtct gtctgtctgt ctgtctatct 6660gtctatcatc tatcaatcat ctatctatct atcttctatt tatctaccta tcattcaatc 6720atctatcttc taactagtta tcatttattt atttgtttac ttactttttt tatttgagac 6780agtatttctc tgagtgacag ccttggctgt cctggaaccc attctgtaac caggctgtcc 6840tcaaactcac agagatccaa ctgcctctgc ctctctggtg ctggggttaa agacgtgcac 6900caccaacgcc ccgctctatc atctatttat gtacttatta ttcagtcatt atctatcctc 6960taactatcca tcatctgtct atccatcatc tatctatcta tctatctatc tatctatcta 7020tctatcatcc atctataatc aattg                                       7045

-   <211> 6473-   <212> DNA-   <213> Cricetulus griseus-   <400> 3

(SEQ ID NO: 54)

agtgaacaca gatcaatctt ctctaagctg cttgagcctg tgttttcccg ttatacacag 60gtaattggtg tgctgttaaa agctacttag aataaatgaa gaagaaaggg agaaggaggc 120agaggagaag gagcaaagaa agaaggaaag ggggagggag ggagaggagg gagggaggga 180gggagggagg gagaggaggg gagggggaga aagaagagaa ggagagatct tttccccact 240gactatctca ggaaattacc acaggtggaa gggggtacta attaaggaat agctgtaagt 300aacattattt ttattcgtag aacctcataa ctcttaagat gtgcttttta cccttttctg 360ccttttagca caaataaact ccaacatgaa aattatccac tgtgcgtgtg aaaataccta 420cacagagttc tgaatcattt gccaaattca agccccaatt tttatttcca ttttgactga 480gagcaagatg ttccttttag gggatggaag cgtctgggtt tcccacactg aatgactcaa 540ctcgaatgtt gcctcattaa cattctcgat ttttccgtaa tctctgctcc atgcattcaa 600gataactgtg cctatcacaa atggcttttt agcagctcca ctctttctgg ggatgtggtg 660gcccttccag tagctgccac cacggattgt cttcaatttc tcacttgttt ttgagttgag 720tgtcagcctg acccctgggc atggccgcac atgactcagg caaagtgaga gtttcatcac 780taaacgtggc tctgtttgct atgtctgttt tccctctaag agcaggttat tcaaatacca 840tctggctgag gtcaagttgc ctcagagccc acagaatctc tacccaggtc cctgttggat 900ccctaaaaac tcagtcatgc tgtaatctcc ttctgaaact gtgcaatgcc tgcaggctgt 960cagcccagct ctctccttct gcttcctgtc ctcctaggac cccatgcctc ctcaaacgtc 1020cacgtgtttc ttgctcctcc accacggttg ccaagccaaa attcgggtgg gcgggaggac 1080attttcccaa gtgcctgttt cccttctttt ccttttgaca ccccagataa atcatctttc 1140ccaatccaac acagccccac tgtgtctttg gggacttcat gacatcaccc aggaatgtat 1200ccttagaaac aaaaatgcaa aacccagaac accaggagac aattaaagaa attttcactg 1260gtgaggtcac aagtagtaga gacttcttgt taacgggcag aaactttcac ggacccagca 1320tgctactgtg gcagttctgc aacaagctga aaatgccttt cccgaccacc caagccagtg 1380ccacacaaag gccaccttag ggtgtgcaca ggatgtcact aggcgttggc ggaactcagg 1440aaggagtctg aatttcttcc cgtttcttcc ttcctctctc attccctatc ttagcttctg 1500tctctctttc ctctctctcg ttccccccct tcctccctcc cttcctgttg cagggccaca 1560gatggaccgg gagacctcaa gcatgtcaaa tcaactaact gctctaccac tcaaccacac 1620cctcgcctgc attgttacta ctactattat tatcttgata caggtctcca cattgagctc 1680accctcacag tctccacatt gagctcaccc tcacagtctc cacattgagc tcaccctcac 1740agtctccaca ttgagctcac cctcacagtc tccacattga gctcaccctc acagtctcca 1800cattgagctc accctgtggc tctggcaaac cttgaattct ctcattcctc ctgcctcagc 1860ctctggggtc gtggggatta gccaaaccca cttgaggttt tcttcaatca gcaaattctt 1920agcgttcaat taacacacac tcataactcc agtactttgg aaaccggaac aggagaattt 1980ctgtgagctg gaggctagct tggactacag tatgagaccc tgtctctaaa taaatacaca 2040aagaaatctc accaagggcc tccctctctc agcaagctct aactgtggtg ggagttctgg 2100gttgttccag ttaacgggct cagaactcta ctgcccagca catcagcccc tagacacagg 2160tggctctcta catgtgaaca tgcagtcaca gaaatgaaat aaagtgaaaa ttttatttct 2220tcagttgtat agcctcttcc gtgtgggctg tagttactgt cttgaatagg ataggctcag 2280aatccttggt gctggaacca agagtttgat tccattagac gacagggaat ataatgccca 2340atagggcatt cctcctcccg gtcactagcg gtgcactttc tccgaatctt tgtcatgttg 2400aattagaaaa gttagtattt tcctccatcc cttcccctcc tcccctcctc ccctcctccc 2460ctcctcccct cctccctccg tctccccgcc cctcccctcc ccctctgatc ctcccccatc 2520tatcaaatcc aagaattcca gtaaaaagag gaaaacaatc gaagtgattt cgttgattgt 2580cagttccacc aaagcaagac ttgactttag ttccgcgttt cggttcccgg catgcaccac 2640agccagcgag caccgtggaa ggatgctagc acggtcctcc ccccgccccc actagctgtc 2700ttcagctccc cagtagaggg caaccgcact ccagattctc aatggagagt gtttacacaa 2760tcgttgcggg tttgtgtgag cgcgcccgct tccagagaca cttcttcttt ttcttttttc 2820catttcatcc cagtggcaac gcagagtgcc agatcattca ggccgtttgc agggcaagcc 2880gtgggagctt ggcaagcaag gccccatttc ctagggaacc cgtgcctggc gcttcaggaa 2940agcacgggaa cctggcactg tgactctgcg ggtattattt tgcagaactc tttattaaac 3000gggagtttca agtccagctg gagacgacca ggcagcgcct ttaaccccag agtcacacac 3060aggtgccttt tcttggggcc agattggggt tgtgtggcag acctgcgacc agcttgacaa 3120ctcttctgcc aggccacaaa atggtgttgg ctgtaagagg tgacaccagg gacagggaag 3180atcgctgcta ttctcctgag ctctccaaag acccacacca gtctgtcccc ctttcctcct 3240gctcttcccc tgtatcgccc cctcaccatc tcccccaacg agactcttgg catctcctcg 3300gcacaaggat ttgaaaatag atgcttgggg gtgagaagaa gaagagagaa agagagagaa 3360ggaaggaagg atatatagat gatacagacg catacaggtg acatgtagct aatcattttt 3420aattaaaaaa taaattaaaa gcaaatcaag gatatatatg atacccttag agcaagtgtc 3480tcatacacac acaaacacac acacacaata tatatatata tatatatata tatatatata 3540tatatatata ttatacttgg aacaagtgtc cagaagggct ggggactcta aagtgcttgt 3600caaagccagg ctcacatcag taatcttatc acctggtaga ctgagacagg aggattttga 3660tgagttcagg cccagcctga gctgcagaat gtgattctat cccaaaaaag taaaataaaa 3720taaaattcaa aatacacgaa aagagtattt gctgaacaaa caagcctaaa gccctggatc 3780ccttccccca tgtcctaaga aaataagttt cttgaagctg gagggatggc tcagaggtta 3840agagccccag ctgcacttgc ggaacactaa gacccagttc ccagacccca cactgtgggt 3900cacaactgtc tcaaacgcca gctccggagg atccatgccc tctcctggcc tccaccggca 3960ccaagaacac atacagtgcc catacattta tgcaagcaag gtattcacgc acataaaact 4020aaaagaatat ttaataaaga tataacaaaa tagcatgaag cccagctggt acagaggttc 4080aaactacatc ccaggttcat ccctctgcct ttgctctcag ttggcttggg taggtctctt 4140ctctgaactg gcgccctgcg ggttccacat tgagaccctc tcatttttaa acctacttct 4200tctgggcggg gttaattgct gccagggctc aagccaacgc ttcctcttct ccacagcaat 4260cttccaagtt tcacgagata accaggaact gctaagttca tgtgaacctt agtgaagaac 4320ctgagtcttc ccatgtgatt ggtgtgtgca tgtgtgcata cacaaatgta tgtgtgtgct 4380ctatgtgtgc ctatgtatgt gtgcatgcat gtgtgcatat acaaatgcat atatgtctat 4440gtagtgtgcg tacacaaatg tatgtgtgtg ctcaatgtgt gcctatgtgt gtgtatgcat 4500gtgtgcgtac acaatgcatg tgtgtggtgt ctgtgtgcct gtgtgtgtat gcatgtatgc 4560atacacaaat gtatatgtgt ggtgtgtgaa tgtgtgccta tgtatgtgtg tgctgtgtgt 4620gggtgtggta tgtgtgtgat gtgtggaggg gtgtgtatgt gtggtatgta taggtgatac 4680gtttggggtg taatatgcgt atgtggtttg tgaaatgtag ttcgtgtgtg tgcatgtgtg 4740cgtgcgtgcg tgcgtgcgtg cgtgtgtgtg tgtgtgtgtg tgtgtgtgtt ggatatagta 4800tgtgtgaggt gtgtgtactc accatggcct ccctcacttg ggggagtgaa gtcagcagcc 4860tggaccactc agggacatga gatactcaga cacatcttga tttccacccc tcttttcctg 4920atcctccttc acgtgtcact ttcccaaaca ctggacaaca gtttgggggc atctgattcc 4980actaatgaca gggacatcac atgtctccag agggaacacc ttctgtgtca catgtcatct 5040gagaatgtag cagagtcaca gagaaatgtc acagaaacca aaatgcagag taccaaggta 5100tagctaggca cagagcagag gggaagccgc tgaatttatt aaaaatgtca gaatcgtaaa 5160agacagggga cagcggtggg gacattcagg gtccagtagc acacaggcag tccaaacctg 5220atcactggaa ggtagtaggt aaggaaaggc tgcacacaga ttattcacac agtttataca 5280tgtacacaga ttattcacat ggtttgtgta tgtgcacaga ttattcacac agtttataca 5340tgtgtggctt cgtggtaact ttgagcttac tttcaattta aaaggatctc tctcacaagc 5400tggggccggg aatggctgca gtcaacactc catcacttag tcacactgtg caaacagcac 5460ctcctgactc atggtgactt gtagtaaaat gaagaggcca catttgcatc caagacagct 5520catcagtacc tagtgaagaa tctgtccctg agtatttgca tgaatggacc cgggtccagg 5580gcctggctgg gagtctccag gtgttgcagc cagaatgtca ttgtgttttt tcaggatccc 5640agaagtttct aaaatacagg ccaagtactc atttgtgtta caaagtatct gactaataga 5700agtgattagg taacacaaag ccttttaaaa accgagatca cccttgtcat gtccctggcc 5760tcttagaaca agatccaagc ttttgctggt tgacaagtgg ggccatccag tgcgtctccg 5820ttcctgctac ttcatctgga agacctctcc cactaacttg cccctgaccc ctcacacctg 5880ctgtttcctt tccacccgga agtgcttgtc taggctttca tggccatctg actgagcatc 5940taggcctcag tccagtggtc cctcagctct ctctagtcac tgtactaatg gaaacggcca 6000ctaactacat tttcaatatg gaagcctcct cctcaggaac ctccaagggc agaagcctcc 6060agagaaccac tcctgacccc ctggagttct gagtgcttct ggccctctct gtgtctgcag 6120gactattcac cacttgtgtt gaatggttca gtcctcacct cctctggcat gtgctcagtt 6180ctcatctcat tggggagtcc ttcccaggtc actcttctct cctgtctttg aagtgttttt 6240ttccttcatg gtatttctgt ctgggcacac acacagacac acatacacac acatacacac 6300ccatgcagta tggcagatac atcacctatg tttcagattt ttattctacc atcacccaat 6360acctgaatcc ccgaaaaagc cttagaaagc caggaatttg tgtatttttg tcagcactcc 6420accccagcac ctgaagccaa gcctgactta atatttttgg ttttgtttct aga        6473

-   <211> 7045-   <212> DNA-   <213> Cricetulus griseus-   <400> 4

(SEQ ID NO: 55)

caattgatta tagatggatg atagatagat agatagatag atagatagat agatagatga 60tggatagaca gatgatggat agttagagga tagataatga ctgaataata agtacataaa 120tagatgatag agcggggcgt tggtggtgca cgtctttaac cccagcacca gagaggcaga 180ggcagttgga tctctgtgag tttgaggaca gcctggttac agaatgggtt ccaggacagc 240caaggctgtc actcagagaa atactgtctc aaataaaaaa agtaagtaaa caaataaata 300aatgataact agttagaaga tagatgattg aatgataggt agataaatag aagatagata 360gatagatgat tgatagatga tagacagata gacagacaga cagacagaca gacagcagaa 420agataatgca cggtgaaaca tggtctgatt tagttagcaa gatcagagaa gccttctttg 480aaagtgacat ttgagagcat ttcaaacgct gttcatgtca ggcatgccaa tggggagaga 540agggcttgca gaaagcaggc ccggcaagcc atggggagca agctaggagg cagcattcct 600tgcatttgcc tctgcctcag ctgcttcctg gagttccccg gtttttatca caacagtaga 660aataaaacca ggacaatgtt gtttccatgc atacatctgc aagaacttac tccggttcaa 720tagacagacc aaggcacctg tgtttgctca agaagcacgg agggaggtgt gtgcacctgc 780tgggtgctgg tgctctggct gtgccagaca gagagcaaga caggaaagtt cctggtggcc 840tagagcacac agcccagccc aggaagtcat gtctctctct gtctctgtct ctgccccacc 900cccaccccat ttaggccaga gaacagctgt ggcaagcttt gggtttgggt gagtcattcc 960tcaagagcca agagccgccc accttgtatg gggtagtttg ttgttgttgt tgttgttatt 1020atttgtttgt ttgtttgttt ggtaaaggtt tttcaatagg agttggaatt tggcaattca 1080gctaggctgg ctgagcagcc agctagcccc gggcactcat ccgtctctac ctccccagtt 1140ctgggatttc gggtacatgc tgccacatcc gacttttttc ccctgctcca gttcttaaga 1200ccaagtcttc atgtcaaaca cttcaccacc ttagccatct ttctgggtca gaagttagat 1260cttcaggaag acaaggagtg tatcaggaca tgagcgtgcc ccaactctgc tcagaccttc 1320tgatagagaa aatgggggga ggggtgtcag aggctgccgg agaaagacaa gtccaggtta 1380aggaggacga ccctgggctc tgaatccaag ggtgattccc tcaccttgta cacttggcat 1440tttgggaagg aagcatcaga taaaagcagt gcagacatag tcaggaatat ttacacgtgt 1500gagtcaacct gggagtgagt ctgtgtacaa ctgaacatga agcaagtttt gaagcttcat 1560ttccagacta ttcccagggt gcaataactt cctgttttcg ttgcagcctt cccagtctct 1620gccactgcca tctctacttc agtctggaat ggtgggcaca cagaaaaagt ctatggcaat 1680cctgcgagaa gacaagtggg cgcctgactt cgggctcctg ttacaagaga ggaatccagg 1740agtttatttt gcagctgatt cagtgttgac caagagtcca gctctggggg agtgggaagc 1800aaccaaagca gagacaggtc ccagcacaat ttttggtttt caagacagca cttctctgtg 1860gctttgaagg ctatcctaga actgttcttt gtatatcctt ccttgcaact agctcttata 1920gaccaggctg gtcttgaact cacagagatc catctgcctc tgcctcccaa gtgctgggat 1980taaaggcgtg cacctcggct gccaccaccc agctacatac ataatttaca ataataaaaa 2040taaaatactt taaagtgtta tagcagtttg aatgtaattg gccctgtcat ctcataggga 2100gtggcactat taggaggtat ggctttgttg aaggaaatat gtcactgtga gggtgggctg 2160tgaggtttcc tatgctcagg gtaccagcca gtgtctcagc tgaggtcctg ttgcctgcaa 2220gatgtaggac tctcatccct ttctccagca ccatgtctgt ctgcatgcca tcatgttccc 2280agccatgatg acaatgtact aaaacctctg aaactgccac ccaactaaat gttttccttt 2340ataagagttg ccatgctcat ggtgtctctt cacagcaata gaaaccctaa ctaagataag 2400tgtattctcc cctactcccc atgatttaaa atttaggaag gcaggtaggc aggcaggcag 2460gctggtatag tggttcattc tagcacctga gacctggaat gggaggattg tgagttagtt 2520ctaggccatt ctggtgccta gaaaccagag ccgggggttg gcccaatgca gagcacttgc 2580tctacgtatg gcccagcaca ataagtcaat ttcctcacct taaaggcttg acaatttaaa 2640aacactggtt tttagttagt ccgtgtctgc tccacagatg gagacagcta atcacagatg 2700catcaggggc cttcctgagt gctaaacatc aaacagcctt ctcccctcct gagcctttgt 2760gtgcagaatg tgtccatcgc aagaagcaaa cagtcttgct tgcccaccaa cttccttcct 2820gcatcagaag agctgggtgc aaactgcaag agtagcctca ccttagagat gggtcccatt 2880gctctacatg ggagcattac cttccaagaa ggcaaaaatg tctcctggtt gagctttttt 2940tgtcacctgt taaaggcaaa tcaacagaga ggctttgtct cacccactaa catcttggaa 3000acaaatacca acgaacgctg gggaggatgt ggggaaagca gagccctcat gctctccgag 3060ggaaaatcac acccactgtg gaacagtgtg gaaacctcaa agactgggat tacaagcagc 3120acacaagcca gccacgctac tcctggtcac acaccacaaa gacgcttgca cattcacgct 3180tacgctgcga acactagcaa cgttcccact gcctcctttg agccccgccc cccgcccctg 3240ccccccgccc cgcccctgtg gtctatgttc ctcttcccta aagtcagctt ccacttctct 3300gtctccatct tcgccccacc ctccctcctc gctacataat tgtctctatt ccatttctct 3360gctttgaaac agctttttgc aaagcatcaa atctattgtc ctatgcccca aatcaacctc 3420cagtttcaca agtgatacag gaaatcgttt tcctaattaa aaatcccccc tttgaccatt 3480tattcccact cttggaacat cttccccttg aggaaagtta cagaatgagg tggctctcct 3540cttcctattc gaggtgtttc cttcagactt tgtccgtgtc taatcttttt aactgttggc 3600caggcctcca ccacggcaca gatgaactgt ggggttcatt tacctgaaac tctatggaag 3660gatgtttatt tctccttcac tttagcaaat gataaagggc accattcact ctgtctattc 3720tgcaggggcc attcctttct ctaggccaga tactgagaat tgctcccaga atcaatgtgg 3780tatacatatt tccccttcaa cattgatagg cattgatcac acacacacac acacacacac 3840acacacacac acacagtagc acaaatgtat tcccctagcc cgcttccatc ttgccacagg 3900actccagagt ggccctggat agcaagcttc ctgttttgtt tctctgttcc tgctgctttt 3960ccaccctcca gtctatcttt tctaagtcct tctgccattg tcctcttccc aactgtcctg 4020agatgcagtc attgtctggg attcagacct tctctctctg cccaagtgag tatattgacc 4080cccacggttt gtacaaccat aacttcaggg agcccgacaa aaactgtttt atgagccaag 4140tagtcccagg acttgagagg tagaggcggg aagatcagca gtttgaggcc agcctggaga 4200gcataagagc cggtctcaaa acaacaatgg aaactagata ctaagtaaaa atcctggggt 4260gtttcatcat gaatgtctgt tcttctagta ccacgctgaa ctccgtacac agctccagct 4320gttacggctt tcttagaatc catactcttt tttttttttt tttttttttt ttttttttgg 4380tttttcgaga cagggtttct ctgtggcttt ggaggctgtc ctggaactag ctcttataga 4440ccaggctggt ctcgaactca cagagatcca cctgcctctg cctccagagt gctgggatta 4500aaggcgtgcg ccaccaacac ccggcagaat ccatactctt tttaaaaaaa gatttatcaa 4560tttactatgt atacagcttt ctgcctgcat gtatccatgc atgtcagaag atggcaccag 4620gtcgcattac agatggttgt gagccaccat gtggttgctg ggaattgaac tcagaatgtc 4680tagaagagca accagttctc ttaacctctg agccatctct ccggccccca gaaatccata 4740ttcttgagga ttttttacac cccccccacc aaaagacgta tatctaaatt ttaatgtgag 4800aattcacatt ttcttaagag ttgaacatag atttagagga aaatcagatc ccacatgatt 4860aacaaagcat gcttgtgggc aggtctgcta ccaagaggtg ggccgtagct tctagctcag 4920acaaactcac tcccttcctc gtggcctctt cgccctcaag tcagaaactc accctgtgat 4980tctgccccag aagttgctct agagcacagt gcatccttcc gtcttcactc tgtggcttga 5040attgtgtcca tcgcttatga ttacaacccc tcacagagca tcctaactgg tttctttgca 5100tgcctatggg cactcctcca ttctagaaca cccttgccat caatactatg aaaggagggg 5160tggaggagga agagcaggaa gaggaggggg aagcgaggga agaggaagac acggatggca 5220atgaggaggg gggagcaccc aagtcctccc tggatgagag tctcactggg agacttaata 5280ttaattataa atgcttggtc agcagctggg caggataagg ttaggcagga gaaccagact 5340aaggactctg ggaagcagaa gggcagagtc agacaaggag aggaaacagg aagtacaagg 5400taaagtcacg tggcagaatg tagataatag aaatgggttc atttaagttg gaagagttag 5460ctagtaacaa gcctgagcta tcagccgagc atttataatt aatattgagc ctccatattg 5520gttatctggg aattggcggg cagaaaaaaa aaagtctgcc tacaagtcaa tgtcatgtag 5580ctcccaaagc caaggtacct ttgttcagtg cttgactgag ccagcattat aaattttctc 5640cagatgtacc gaatcacatt tcatagcaac atgcagacat caagttttcc ctgaagctct 5700aaccagctgg ttgcatgctg tccggagtct cagctataac ccagaagtga cctgggtcgg 5760ggaagaggtg gtactttgcc ttctttgcac tctctgtgtt gcctcaccca ttcagcttca 5820agcaatgtga ctgcctgacc ctgagggcgt ttacaacgcc tgacccacag accacaagtc 5880aaccagctgg tgtgctcacg atacctagtc tgaaccatag ccctgctccc accctgcctc 5940catctccacc ctttcttcac tgctcatcac agctggctag caaagactgc ctcagacctg 6000agcacaggct ccactccaca gccgtgactg ttcgagccac ttaaatcaaa gagcgcttgt 6060cttccgctca gtaaatctct cctcagctca ctgatgacgt tgactttctc tagacagcac 6120atttgggttt aagacactgc tacttgagct cttcattcag ttcctcagaa tacctcattt 6180gggtcagatt cccaaagagg aagatagggt tcctggcaga cagacatgtc tcattccttt 6240gaaatccttc agagaaatgc agtgactatg gcaccttctt aaaaagcaca cacacaaata 6300acacacacac acacacacac acacacacac acacacacac atatccccct cactgtcatc 6360cttgatatgt atatgatata tataaaatca ttgttttata ctgtgataat tgattatgaa 6420taaaatttac taaaatgaac aattaaaatt atgggggggg ctggagagat ggctcatcag 6480ttaagagaac agttgctgct cttgcagaac acgagagttc agttcccagc acccacatca 6540ggcagctcat aaccatgtgt ggtgtcagtt ccaggagatc tggtgccctc ttctggcctc 6600ctccagcacc tgctacatgt ggttcacaca cacacacaca cacacacaca cacacacaca 6660cacacacaca caaataaata taaagattat ttttttcaaa actgagttaa aaataggttc 6720tatctgattc atactaaggc ttttcacagt ggttaagtct attagatatg tctagccata 6780tcctttctcc cttctttctt gaggagaggc ttttaaagct acaagttaca gccttctttg 6840caaataagag taccatttaa caggcctctg accaatgaga tgccagaatc ggttgcccag 6900gagcttccca aacagtccat tatagggaaa ggtggtacaa accagtagat taggcatgtt 6960ccacttccta agtgccgtgc caaataagga aatggcctca aatgtttgcc ttttatcttc 7020acccacctct gaattgcacg ctagt                                       7045

-   <211> 13515-   <212> DNA-   <213> Cricetulus griseus-   <400> 5

(SEQ ID NO:56)

tctagaaaca aaaccaaaaa tattaagtca ggcttggctt caggtgctgg ggtggagtgc 60tgacaaaaat acacaaattc ctggctttct aaggcttttt cggggattca ggtattgggt 120gatggtagaa taaaaatctg aaacataggt gatgtatctg ccatactgca tgggtgtgta 180tgtgtgtgta tgtgtgtctg tgtgtgtgcc cagacagaaa taccatgaag gaaaaaaaca 240cttcaaagac aggagagaag agtgacctgg gaaggactcc ccaatgagat gagaactgag 300cacatgccag aggaggtgag gactgaacca ttcaacacaa gtggtgaata gtcctgcaga 360cacagagagg gccagaagca ctcagaactc cagggggtca ggagtggttc tctggaggct 420tctgcccttg gaggttcctg aggaggaggc ttccatattg aaaatgtagt tagtggccgt 480ttccattagt acagtgacta gagagagctg agggaccact ggactgaggc ctagatgctc 540agtcagatgg ccatgaaagc ctagacaagc acttccgggt ggaaaggaaa cagcaggtgt 600gaggggtcag gggcaagtta gtgggagagg tcttccagat gaagtagcag gaacggagac 660gcactggatg gccccacttg tcaaccagca aaagcttgga tcttgttcta agaggccagg 720gacatgacaa gggtgatctc ggtttttaaa aggctttgtg ttacctaatc acttctatta 780gtcagatact ttgtaacaca aatgagtact tggcctgtat tttagaaact tctgggatcc 840tgaaaaaaca caatgacatt ctggctgcaa cacctggaga ctcccagcca ggccctggac 900ccgggtccat tcatgcaaat actcagggac agattcttca ctaggtactg atgagctgtc 960ttggatgcaa atgtggcctc ttcattttac tacaagtcac catgagtcag gaggtgctgt 1020ttgcacagtg tgactaagtg atggagtgtt gactgcagcc attcccggcc ccagcttgtg 1080agagagatcc ttttaaattg aaagtaagct caaagttacc acgaagccac acatgtataa 1140actgtgtgaa taatctgtgc acatacacaa accatgtgaa taatctgtgt acatgtataa 1200actgtgtgaa taatctgtgt gcagcctttc cttacctact accttccagt gatcaggttt 1260ggactgcctg tgtgctactg gaccctgaat gtccccaccg ctgtcccctg tcttttacga 1320ttctgacatt tttaataaat tcagcggctt cccctctgct ctgtgcctag ctataccttg 1380gtactctgca ttttggtttc tgtgacattt ctctgtgact ctgctacatt ctcagatgac 1440atgtgacaca gaaggtgttc cctctggaga catgtgatgt ccctgtcatt agtggaatca 1500gatgccccca aactgttgtc cagtgtttgg gaaagtgaca cgtgaaggag gatcaggaaa 1560agaggggtgg aaatcaagat gtgtctgagt atctcatgtc cctgagtggt ccaggctgct 1620gacttcactc ccccaagtga gggaggccat ggtgagtaca cacacctcac acatactata 1680tccaacacac acacacacac acacacacac acgcacgcac gcacgcacgc acgcacacat 1740gcacacacac gaactacatt tcacaaacca catacgcata ttacacccca aacgtatcac 1800ctatacatac cacacataca cacccctcca cacatcacac acataccaca cccacacaca 1860gcacacacat acataggcac acattcacac accacacata tacatttgtg tatgcataca 1920tgcatacaca cacaggcaca cagacaccac acacatgcat tgtgtacgca cacatgcata 1980cacacacata ggcacacatt gagcacacac atacatttgt gtacgcacac tacatagaca 2040tatatgcatt tgtatatgca cacatgcatg cacacataca taggcacaca tagagcacac 2100acatacattt gtgtatgcac acatgcacac accaatcaca tgggaagact caggttcttc 2160actaaggttc acatgaactt agcagttcct ggttatctcg tgaaacttgg aagattgctg 2220tggagaagag gaagcgttgg cttgagccct ggcagcaatt aaccccgccc agaagaagta 2280ggtttaaaaa tgagagggtc tcaatgtgga acccgcaggg cgccagttca gagaagagac 2340ctacccaagc caactgagag caaaggcaga gggatgaacc tgggatgtag tttgaacctc 2400tgtaccagct gggcttcatg ctattttgtt atatctttat taaatattct tttagtttta 2460tgtgcgtgaa taccttgctt gcataaatgt atgggcactg tatgtgttct tggtgccggt 2520ggaggccagg agagggcatg gatcctccgg agctggcgtt tgagacagtt gtgacccaca 2580gtgtggggtc tgggaactgg gtcttagtgt tccgcaagtg cagctggggc tcttaacctc 2640tgagccatcc ctccagcttc aagaaactta ttttcttagg acatggggga agggatccag 2700ggctttaggc ttgtttgttc agcaaatact cttttcgtgt attttgaatt ttattttatt 2760ttactttttt gggatagaat cacattctgc agctcaggct gggcctgaac tcatcaaaat 2820cctcctgtct cagtctacca ggtgataaga ttactgatgt gagcctggct ttgacaagca 2880ctttagagtc cccagccctt ctggacactt gttccaagta taatatatat atatatatat 2940atatatatat atatatatat atatattgtg tgtgtgtgtt tgtgtgtgta tgagacactt 3000gctctaaggg tatcatatat atccttgatt tgcttttaat ttatttttta attaaaaatg 3060attagctaca tgtcacctgt atgcgtctgt atcatctata tatccttcct tccttctctc 3120tctttctctc ttcttcttct cacccccaag catctatttt caaatccttg tgccgaggag 3180atgccaagag tctcgttggg ggagatggtg agggggcgat acaggggaag agcaggagga 3240aagggggaca gactggtgtg ggtctttgga gagctcagga gaatagcagc gatcttccct 3300gtccctggtg tcacctctta cagccaacac cattttgtgg cctggcagaa gagttgtcaa 3360gctggtcgca ggtctgccac acaaccccaa tctggcccca agaaaaggca cctgtgtgtg 3420actctggggt taaaggcgct gcctggtcgt ctccagctgg acttgaaact cccgtttaat 3480aaagagttct gcaaaataat acccgcagag tcacagtgcc aggttcccgt gctttcctga 3540agcgccaggc acgggttccc taggaaatgg ggccttgctt gccaagctcc cacggcttgc 3600cctgcaaacg gcctgaatga tctggcactc tgcgttgcca ctgggatgaa atggaaaaaa 3660gaaaaagaag aagtgtctct ggaagcgggc gcgctcacac aaacccgcaa cgattgtgta 3720aacactctcc attgagaatc tggagtgcgg ttgccctcta ctggggagct gaagacagct 3780agtgggggcg gggggaggac cgtgctagca tccttccacg gtgctcgctg gctgtggtgc 3840atgccgggaa ccgaaacgcg gaactaaagt caagtcttgc tttggtggaa ctgacaatca 3900acgaaatcac ttcgattgtt ttcctctttt tactggaatt cttggatttg atagatgggg 3960gaggatcaga gggggagggg aggggcgggg agacggaggg aggaggggag gaggggagga 4020ggggaggagg ggaggagggg aagggatgga ggaaaatact aacttttcta attcaacatg 4080acaaagattc ggagaaagtg caccgctagt gaccgggagg aggaatgccc tattgggcat 4140tatattccct gtcgtctaat ggaatcaaac tcttggttcc agcaccaagg attctgagcc 4200tatcctattc aagacagtaa ctacagccca cacggaagag gctatacaac tgaagaaata 4260aaattttcac tttatttcat ttctgtgact gcatgttcac atgtagagag ccacctgtgt 4320ctaggggctg atgtgctggg cagtagagtt ctgagcccgt taactggaac aacccagaac 4380tcccaccaca gttagagctt gctgagagag ggaggccctt ggtgagattt ctttgtgtat 4440ttatttagag acagggtctc atactgtagt ccaagctagc ctccagctca cagaaattct 4500cctgttccgg tttccaaagt actggagtta tgagtgtgtg ttaattgaac gctaagaatt 4560tgctgattga agaaaacctc aagtgggttt ggctaatccc cacgacccca gaggctgagg 4620caggaggaat gagagaattc aaggtttgcc agagccacag ggtgagctca atgtggagac 4680tgtgagggtg agctcaatgt ggagactgtg agggtgagct caatgtggag actgtgaggg 4740tgagctcaat gtggagactg tgagggtgag ctcaatgtgg agactgtgag ggtgagctca 4800atgtggagac ctgtatcaag ataataatag tagtagtaac aatgcaggcg agggtgtggt 4860tgagtggtag agcagttagt tgatttgaca tgcttgaggt ctcccggtcc atctgtggcc 4920ctgcaacagg aagggaggga ggaagggggg gaacgagaga gaggaaagag agacagaagc 4980taagataggg aatgagagag gaaggaagaa acgggaagaa attcagactc cttcctgagt 5040tccgccaacg cctagtgaca tcctgtgcac accctaaggt ggcctttgtg tggcactggc 5100ttgggtggtc gggaaaggca ttttcagctt gttgcagaac tgccacagta gcatgctggg 5160tccgtgaaag tttctgcccg ttaacaagaa gtctctacta cttgtgacct caccagtgaa 5220aatttcttta attgtctcct ggtgttctgg gttttgcatt tttgtttcta aggatacatt 5280cctgggtgat gtcatgaagt ccccaaagac acagtggggc tgtgttggat tgggaaagat 5340gatttatctg gggtgtcaaa aggaaaagaa gggaaacagg cacttgggaa aatgtcctcc 5400cgcccacccg aattttggct tggcaaccgt ggtggaggag caagaaacac gtggacgttt 5460gaggaggcat ggggtcctag gaggacagga agcagaagga gagagctggg ctgacagcct 5520gcaggcattg cacagtttca gaaggagatt acagcatgac tgagttttta gggatccaac 5580agggacctgg gtagagattc tgtgggctct gaggcaactt gacctcagcc agatggtatt 5640tgaataacct gctcttagag ggaaaacaga catagcaaac agagccacgt ttagtgatga 5700aactctcact ttgcctgagt catgtgcggc catgcccagg ggtcaggctg acactcaact 5760caaaaacaag tgagaaattg aagacaatcc gtggtggcag ctactggaag ggccaccaca 5820tccccagaaa gagtggagct gctaaaaagc catttgtgat aggcacagtt atcttgaatg 5880catggagcag agattacgga aaaatcgaga atgttaatga ggcaacattc gagttgagtc 5940attcagtgtg ggaaacccag acgcttccat cccctaaaag gaacatcttg ctctcagtca 6000aaatggaaat aaaaattggg gcttgaattt ggcaaatgat tcagaactct gtgtaggtat 6060tttcacacgc acagtggata attttcatgt tggagtttat ttgtgctaaa aggcagaaaa 6120gggtaaaaag cacatcttaa gagttatgag gttctacgaa taaaaataat gttacttaca 6180gctattcctt aattagtacc cccttccacc tgtggtaatt tcctgagata gtcagtgggg 6240aaaagatctc tccttctctt ctttctcccc ctcccctcct ctccctccct ccctccctcc 6300ctccctcctc tccctccctc cccctttcct tctttctttg ctccttctcc tctgcctcct 6360tctccctttc ttcttcattt attctaagta gcttttaaca gcacaccaat tacctgtgta 6420taacgggaaa acacaggctc aagcagctta gagaagattg atctgtgttc actagcgtgc 6480aattcagagg tgggtgaaga taaaaggcaa acatttgagg ccatttcctt atttggcacg 6540gcacttagga agtggaacat gcctaatcta ctggtttgta ccacctttcc ctataatgga 6600ctgtttggga agctcctggg caaccgattc tggcatctca ttggtcagag gcctgttaaa 6660tggtactctt atttgcaaag aaggctgtaa cttgtagctt taaaagcctc tcctcaagaa 6720agaagggaga aaggatatgg ctagacatat ctaatagact taaccactgt gaaaagcctt 6780agtatgaatc agatagaacc tatttttaac tcagttttga aaaaaataat ctttatattt 6840atttgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gaaccacatg 6900tagcaggtgc tggaggaggc cagaagaggg caccagatct cctggaactg acaccacaca 6960tggttatgag ctgcctgatg tgggtgctgg gaactgaact ctcgtgttct gcaagagcag 7020caactgttct cttaactgat gagccatctc tccagccccc cccataattt taattgttca 7080ttttagtaaa ttttattcat aatcaattat cacagtataa aacaatgatt ttatatatat 7140catatacata tcaaggatga cagtgagggg gatatgtgtg tgtgtgtgtg tgtgtgtgtg 7200tgtgtgtgtg tgtgttattt gtgtgtgtgc tttttaagaa ggtgccatag tcactgcatt 7260tctctgaagg atttcaaagg aatgagacat gtctgtctgc caggaaccct atcttcctct 7320ttgggaatct gacccaaatg aggtattctg aggaactgaa tgaagagctc aagtagcagt 7380gtcttaaacc caaatgtgct gtctagagaa agtcaacgtc atcagtgagc tgaggagaga 7440tttactgagc ggaagacaag cgctctttga tttaagtggc tcgaacagtc acggctgtgg 7500agtggagcct gtgctcaggt ctgaggcagt ctttgctagc cagctgtgat gagcagtgaa 7560gaaagggtgg agatggaggc agggtgggag cagggctatg gttcagacta ggtatcgtga 7620gcacaccagc tggttgactt gtggtctgtg ggtcaggcgt tgtaaacgcc ctcagggtca 7680ggcagtcaca ttgcttgaag ctgaatgggt gaggcaacac agagagtgca aagaaggcaa 7740agtaccacct cttccccgac ccaggtcact tctgggttat agctgagact ccggacagca 7800tgcaaccagc tggttagagc ttcagggaaa acttgatgtc tgcatgttgc tatgaaatgt 7860gattcggtac atctggagaa aatttataat gctggctcag tcaagcactg aacaaaggta 7920ccttggcttt gggagctaca tgacattgac ttgtaggcag actttttttt ttctgcccgc 7980caattcccag ataaccaata tggaggctca atattaatta taaatgctcg gctgatagct 8040caggcttgtt actagctaac tcttccaact taaatgaacc catttctatt atctacattc 8100tgccacgtga ctttaccttg tacttcctgt ttcctctcct tgtctgactc tgcccttctg 8160cttcccagag tccttagtct ggttctcctg cctaacctta tcctgcccag ctgctgacca 8220agcatttata attaatatta agtctcccag tgagactctc atccagggag gacttgggtg 8280ctcccccctc ctcattgcca tccgtgtctt cctcttccct cgcttccccc tcctcttcct 8340gctcttcctc ctccacccct cctttcatag tattgatggc aagggtgttc tagaatggag 8400gagtgcccat aggcatgcaa agaaaccagt taggatgctc tgtgaggggt tgtaatcata 8460agcgatggac acaattcaag ccacagagtg aagacggaag gatgcactgt gctctagagc 8520aacttctggg gcagaatcac agggtgagtt tctgacttga gggcgaagag gccacgagga 8580agggagtgag tttgtctgag ctagaagcta cggcccacct cttggtagca gacctgccca 8640caagcatgct ttgttaatca tgtgggatct gattttcctc taaatctatg ttcaactctt 8700aagaaaatgt gaattctcac attaaaattt agatatacgt cttttggtgg ggggggtgta 8760aaaaatcctc aagaatatgg atttctgggg gccggagaga tggctcagag gttaagagaa 8820ctggttgctc ttctagacat tctgagttca attcccagca accacatggt ggctcacaac 8880catctgtaat gcgacctggt gccatcttct gacatgcatg gatacatgca ggcagaaagc 8940tgtatacata gtaaattgat aaatcttttt ttaaaaagag tatggattct gccgggtgtt 9000ggtggcgcac gcctttaatc ccagcactct ggaggcagag gcaggtggat ctctgtgagt 9060tcgagaccag cctggtctat aagagctagt tccaggacag cctccaaagc cacagagaaa 9120ccctgtctcg aaaaaccaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaga gtatggattc 9180taagaaagcc gtaacagctg gagctgtgta cggagttcag cgtggtacta gaagaacaga 9240cattcatgat gaaacacccc aggattttta cttagtatct agtttccatt gttgttttga 9300gaccggctct tatgctctcc aggctggcct caaactgctg atcttcccgc ctctacctct 9360caagtcctgg gactacttgg ctcataaaac agtttttgtc gggctccctg aagttatggt 9420tgtacaaacc gtgggggtca atatactcac ttgggcagag agagaaggtc tgaatcccag 9480acaatgactg catctcagga cagttgggaa gaggacaatg gcagaaggac ttagaaaaga 9540tagactggag ggtggaaaag cagcaggaac agagaaacaa aacaggaagc ttgctatcca 9600gggccactct ggagtcctgt ggcaagatgg aagcgggcta ggggaataca tttgtgctac 9660tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgat caatgcctat caatgttgaa 9720ggggaaatat gtataccaca ttgattctgg gagcaattct cagtatctgg cctagagaaa 9780ggaatggccc ctgcagaata gacagagtga atggtgccct ttatcatttg ctaaagtgaa 9840ggagaaataa acatccttcc atagagtttc aggtaaatga accccacagt tcatctgtgc 9900cgtggtggag gcctggccaa cagttaaaaa gattagacac ggacaaagtc tgaaggaaac 9960acctcgaata ggaagaggag agccacctca ttctgtaact ttcctcaagg ggaagatgtt 10020ccaagagtgg gaataaatgg tcaaaggggg gatttttaat taggaaaacg atttcctgta 10080tcacttgtga aactggaggt tgatttgggg cataggacaa tagatttgat gctttgcaaa 10140aagctgtttc aaagcagaga aatggaatag agacaattat gtagcgagga gggagggtgg 10200ggcgaagatg gagacagaga agtggaagct gactttaggg aagaggaaca tagaccacag 10260gggcggggcg gggggcaggg gcggggggcg gggctcaaag gaggcagtgg gaacgttgct 10320agtgttcgca gcgtaagcgt gaatgtgcaa gcgtctttgt ggtgtgtgac caggagtagc 10380gtggctggct tgtgtgctgc ttgtaatccc agtctttgag gtttccacac tgttccacag 10440tgggtgtgat tttccctcgg agagcatgag ggctctgctt tccccacatc ctccccagcg 10500ttcgttggta tttgtttcca agatgttagt gggtgagaca aagcctctct gttgatttgc 10560ctttaacagg tgacaaaaaa agctcaacca ggagacattt ttgccttctt ggaaggtaat 10620gctcccatgt agagcaatgg gacccatctc taaggtgagg ctactcttgc agtttgcacc 10680cagctcttct gatgcaggaa ggaagttggt gggcaagcaa gactgtttgc ttcttgcgat 10740ggacacattc tgcacacaaa ggctcaggag gggagaaggc tgtttgatgt ttagcactca 10800ggaaggcccc tgatgcatct gtgattagct gtctccatct gtggagcaga cacggactaa 10860ctaaaaacca gtgtttttaa attgtcaagc ctttaaggtg aggaaattga cttattgtgc 10920tgggccatac gtagagcaag tgctctgcat tgggccaacc cccggctctg gtttctaggc 10980accagaatgg cctagaacta actcacaatc ctcccattcc aggtctcagg tgctagaatg 11040aaccactata ccagcctgcc tgcctgccta cctgccttcc taaattttaa atcatgggga 11100gtaggggaga atacacttat cttagttagg gtttctattg ctgtgaagag acaccatgag 11160catggcaact cttataaagg aaaacattta gttgggtggc agtttcagag gttttagtac 11220attgtcatca tggctgggaa catgatggca tgcagacaga catggtgctg gagaaaggga 11280tgagagtcct acatcttgca ggcaacagga cctcagctga gacactggct ggtaccctga 11340gcataggaaa cctcacagcc caccctcaca gtgacatatt tccttcaaca aagccatacc 11400tcctaatagt gccactccct atgagatgac agggccaatt acattcaaac tgctataaca 11460ctttaaagta ttttattttt attattgtaa attatgtatg tagctgggtg gtggcagccg 11520aggtgcacgc ctttaatccc agcacttggg aggcagaggc agatggatct ctgtgagttc 11580aagaccagcc tggtctataa gagctagttg caaggaagga tatacaaaga acagttctag 11640gatagccttc aaagccacag agaagtgctg tcttgaaaac caaaaattgt gctgggacct 11700gtctctgctt tggttgcttc ccactccccc agagctggac tcttggtcaa cactgaatca 11760gctgcaaaat aaactcctgg attcctctct tgtaacagga gcccgaagtc aggcgcccac 11820ttgtcttctc gcaggattgc catagacttt ttctgtgtgc ccaccattcc agactgaagt 11880agagatggca gtggcagaga ctgggaaggc tgcaacgaaa acaggaagtt attgcaccct 11940gggaatagtc tggaaatgaa gcttcaaaac ttgcttcatg ttcagttgta cacagactca 12000ctcccaggtt gactcacacg tgtaaatatt cctgactatg tctgcactgc ttttatctga 12060tgcttccttc ccaaaatgcc aagtgtacaa ggtgagggaa tcacccttgg attcagagcc 12120cagggtcgtc ctccttaacc tggacttgtc tttctccggc agcctctgac acccctcccc 12180ccattttctc tatcagaagg tctgagcaga gttggggcac gctcatgtcc tgatacactc 12240cttgtcttcc tgaagatcta acttctgacc cagaaagatg gctaaggtgg tgaagtgttt 12300gacatgaaga cttggtctta agaactggag caggggaaaa aagtcggatg tggcagcatg 12360tacccgaaat cccagaactg gggaggtaga gacggatgag tgcccggggc tagctggctg 12420ctcagccagc ctagctgaat tgccaaattc caactcctat tgaaaaacct ttaccaaaca 12480aacaaacaaa caaataataa caacaacaac aacaacaaac taccccatac aaggtgggcg 12540gctcttggct cttgaggaat gactcaccca aacccaaagc ttgccacagc tgttctctgg 12600cctaaatggg gtgggggtgg ggcagagaca gagacagaga gagacatgac ttcctgggct 12660gggctgtgtg ctctaggcca ccaggaactt tcctgtcttg ctctctgtct ggcacagcca 12720gagcaccagc acccagcagg tgcacacacc tccctccgtg cttcttgagc aaacacaggt 12780gccttggtct gtctattgaa ccggagtaag ttcttgcaga tgtatgcatg gaaacaacat 12840tgtcctggtt ttatttctac tgttgtgata aaaaccgggg aactccagga agcagctgag 12900gcagaggcaa atgcaaggaa tgctgcctcc tagcttgctc cccatggctt gccgggcctg 12960ctttctgcaa gcccttctct ccccattggc atgcctgaca tgaacagcgt ttgaaatgct 13020ctcaaatgtc actttcaaag aaggcttctc tgatcttgct aactaaatca gaccatgttt 13080caccgtgcat tatctttctg ctgtctgtct gtctgtctgt ctgtctatct gtctatcatc 13140tatcaatcat ctatctatct atcttctatt tatctaccta tcattcaatc atctatcttc 13200taactagtta tcatttattt atttgtttac ttactttttt tatttgagac agtatttctc 13260tgagtgacag ccttggctgt cctggaaccc attctgtaac caggctgtcc tcaaactcac 13320agagatccaa ctgcctctgc ctctctggtg ctggggttaa agacgtgcac caccaacgcc 13380ccgctctatc atctatttat gtacttatta ttcagtcatt atctatcctc taactatcca 13440tcatctgtct atccatcatc tatctatcta tctatctatc tatctatcta tctatcatcc 13500atctataatc aattg                                                  13515

-   <211> 14553-   <212> DNA-   <213> Mus musculus-   <400> 6

(SEQ ID NO:57)

cttgaagaac acatgttttc caagagggag cacccatgtt ggaatgacaa tgtagttagt 60gctcctctcc tgtaggttag tgctcctttg ctataggtaa gtgctcctct cctataggtc 120agtgctcctc tcctataggt tagtgctcct ctcctatagg ttagtgctcc tctcctacag 180gttagtgctc ctctgctcta ggttagtcct gctctcctat agtacctaga gagctagggc 240aaatgggcta ggcccgaagt gcagagacaa acagctatgg aagactgggt aagcacttcc 300aagctacgaa agagcagtgt gaagggtcag ggcttgtgca gttagtaggg gagatcttcc 360agttgaagaa acagaagaac tgagagccac tgggtatcat cctcctgcgc catgccttcc 420tggatactgc catgctccca ccttgatgat aatggaatga acctctgaac ctgtaagcca 480gccccaatga aatattgttt ttatgagagt tgccttggtc atgctgtctg ttcacagcag 540taaaacccta aataaggcag aagttggtac cagtattgct gtgatagacc tgaccatgct 600ttcctttgaa agaatgtgga tttggtgact ttggatttgc aacacagtgg aatgctttaa 660atggagatta atgggtcatc aattcctagt aggaatatgg aagactttgt tgctgggagt 720atttgaactg tgttgacctg gcctaagaga tttcaaagga gaagaatttc agaatgtggc 780ataaagacag tttttgtggt attttggtga agaatgtggc tactttttgc ccttgtctga 840aaagtctgcc tgagactaaa gtgaagagaa tcagattaat tgcattgaca agggaagttt 900gtggctgcgc tatctggaaa cttacagcca gcctcttgga cctcgggtga cttacgcaaa 960tactcaggga cagagatgct tgactctgta ctgatgagtt gtcttggatg caaatatggg 1020ctcttcattt gactacatgt cacgatgagt caggagctgc tctctccaga gtgtgacaaa 1080gcgaggggat gctgacggta gctgttctag ctttgaaggt aagcctgcac ttatgctaaa 1140gtcacacata cacgagccgg gtggagaacc tgtctgtgtg gagacacctt tcattacctg 1200tggcatccag cctctcaagc ttggactgcc tgtgtgctcc tggactctgg aggtcccact 1260gctctgtcct ctgctgctta tgatactgac attttaaaag aatccagtgg ttcccccctg 1320tactcggtgt ctacttctac ctggatgttc ctcatttatg ttctgtgaca cttctctgtg 1380actctgctgc attcctgggt gacatgtgga caccctgtcc ctttgcagac catgatgtca 1440ctgtcactag tggaatcaga tgccccaagt gttgtcctgt gtttgggaac gtgacaggca 1500gtacagaagc agaagaggaa gggtgaaaac ggaaatgtca cagcagcatc tgatgtgtgc 1560ctcagtcacg catgctgctg attggaacta ctcagcatga gagagggcca tggtgaatac 1620acaaccctat acacactgtg tccatttctc tctctctctt acacagagag agagggagga 1680gggggagggg gaggcggagg gggaggggga gggagaggga gtgggagagg gagagggaga 1740gggagaggga gagggagagg gagagggaga gggagagttt aatgtctgtg aagagatacc 1800atgaccaaag caactcttat aaaggacaac atttaattgg ggctggctta caggttcaga 1860aattcagtcc attctcacca tggtgggaag catgcaggta gatgtggtgc tggaggaacc 1920aagagttcta tatcctgatc tgaaggcagc caggagaaga ctgcctcttc tgcacagggc 1980agagcttgag catagaacat caaagccctt ccccacactt cctccaacaa ggtcatacat 2040acttcaacaa agacacacct cctaacggtg ccactccctg tggaccaacc atttaaacgc 2100atgagtctat gagggtcaaa gctcttcaaa ccaccacact catgtacaca cacacacaca 2160cacacacaca ctctcataca cacacacaca cacactcaca cacacacaca cacacacaca 2220cacacacaca ccacacacac acacacacac agagttctat tttgcactgt ttcactgtca 2280caaggttcta cttatctcag acacactgcc aggaattgtg tgggaagact ttcagtttct 2340ttgggttcac atggacttag cagttcttgg tgatcctgaa agatttctgc agaaagaagc 2400caaagtgttg agcccaaggc ctggccacac attagtcctg tctagatgaa caggggttta 2460aaaataaggg ggcatcaagg tgaagccagc aggggctgac ttagagagga gacccaccca 2520agccaactgc tcgaagtcaa aagcgatgaa tccccatatc cagctgtgcc cggtgctgtc 2580ttgctacatc tttagtaaat gttcttttag ttgtatgcgt atgaatattt tgcttgcata 2640tatttgtgta caccataggt gttcctaggg cctatggagg ccagaagagg gcatcagatc 2700ctttggaact ggaattatag acacttgtta cccatagagt agattgtggg aaatgagcct 2760ttagtcttcg agagcggcca gtgctcttaa cctttggtcg tttctccagg tctttgagac 2820tttattttct tggacatcag gacaggatcc agggctttga gcttgtttct tcagccagct 2880ttcttttcat gtatattaaa ttttatgtta ttttgctttc tttttcccca agacagaatc 2940acactctata tagctcaggc tgggtttgaa ttcagtttcc ctgtctcagt ctaccgggta 3000atatgattac agatgtgagt ctgactttgg tatcaaagtc cccagccctt ctggatatgt 3060gttttaagga tatcagatat atccttgatt tgctttgaat tttcttttta gttacaacat 3120aattagttcc gtgtcacctg aatatgtgta tgtcacctac atagtcttcc ttcttctctt 3180cttccctctc ccaccttccc aggtacctgt ctgtcttcat atccttgtgc tgagagtctt 3240gttgagggag atgatgaccg agacagagcc actggggaag ggagatgggc tagtgcaggt 3300cttcagagag gagctcgtga atattgtagc ccctttagtc cctggcatgt cctcttgtat 3360agccaccgcc atgctgtggc ctggcagaag tgaataagtt gtccagctgt tgacaggcct 3420gccctccaga cccagtctga tcccaagaaa gggcatctgt gtctgtctct gaggccgtaa 3480gtgctgcctg gttgtctcca gcttgacttg acactccctc cttaataaga gtaccacaga 3540acagggtctg cagagtccct gggccaggtc cctgtgctgt cctggaatgc caggcgtgaa 3600tttcctgtga agtaggactt tgctcgccaa gctcccacgg cttgcccttc agatagccag 3660aattatctgg taccctgcat tgccgttcaa tacgcagagt atcactggaa gcgcgcgcgc 3720gcacacacac acacacacac acacacacac acacacacac acacgcccac tccatcttta 3780aaccccaccc cccagcaacg gcggtgtaaa cactctccat caggaagctg aaacgcagtt 3840gccctctgct ggggagatga aggcagcttg ctgggggcga ggaccgtgct agcaaccttc 3900cctggtgcac acgggctctg gtgcatgacg ggaacggaaa cgcggaacta aagtcagtcc 3960tgcttttttt tttttttttt tttttttttt tttttttttt tttttttttt ggcgttggtg 4020gtggactgag tgacaatcag tgaaatcact taggttgttt ttctcttctt cgttgggttt 4080gatagacggt gggagagggt cagaggagaa ggggagggat ggggagagag ggaggaggga 4140ggggcgggag gcggggggcg aggaaaacgt gctaacttct ccaatcctac aagacaaagg 4200tttggagaaa gccgcactga gtgacccagc agaaggaatc caggaatgtc cgctggaatc 4260tgactgttga ttccagcgcc atgcagagaa tctaggctgg taggaacatt ctttgtccta 4320tccgacataa taactccaac caacacggaa aagaaaggct atacaagtga agaaatggca 4380ttttcacttt catgactata caatcacttc caggtagtaa cacgtgtcta gcacagcggt 4440tctcaacctg ggggtcacga tcccccactt ttctgcatat cagacatttt tacgttgtta 4500ttcataacag tagcaaaatt gcagctatga agtaacaatg aaatgcattt atggtgcgtg 4560tgtgtgtgtg tgggggggta tcaccttaac atttactgta agaaggttga gaatactgct 4620ccagcagcta gtgtgttgga cttaggttct gggtatatta ttagcaatag ccaaccagaa 4680tccccaccca ccacagcatt gaggccccat gcagggcttg ctgggagagg cactgataag 4740acttctttat gtatttattt agagacgaat actcattagg taggccaagc tagcgtcaaa 4800ctcatggcaa ttctcctcct ccagtttcct aagtactgga ctcaggagtg tgttgccatc 4860atatacagta aggatttatt gactgaagaa aatctcaagt ggctttggtt aatccctact 4920acgccagagg ctgaggcagg aggcgcgcaa ggtcaaggct tgcctgggct acatatagag 4980tgagctcaat tttgacactt ggtgcggtgt tagtagtaat agtaaagatg aaggtgtggc 5040tcaggtgggg ccggtgattg gacacacttg gggtctcctg gtccatctgc agctgtgcaa 5100caggaagagc ggagaatgag aggaaagaga gaaaagacag aatgagagag agggaggaag 5160agagaaaaag gaaaagagag aggaaaggaa aaaggaaaat gaggaaagcg agaaagaaga 5220aatgagaaag aggaaaggga gaaagaaatg agagagagaa aagaaaagac agaatgcgag 5280agagggagga agagagaaaa aggaaaagag agaggaaagg aaaaaggaaa atgaggaaag 5340cgagaaagaa gaaatgagaa agaggaaagg gagaaagaaa tgagagagag aaaagaaaag 5400acagaatgcg agagagggag gaagagagaa aaaggaaaag agagaggaag ggaaaaagga 5460aaatgaggaa agcgagaaag aagaaatgag aaagaggaaa gggagaaaga aatgagagag 5520agaaaagaaa agacagaatg cgagagaggg aggaagagag aaaaaggaaa agagagagga 5580agggaaaaag gaaaatgagg aaagcgagaa agaagaaatg agaaagagga aagggagaaa 5640gaaatgagag agagaaaaga aaagacagaa tgcgagagag ggaggaagag agaaaaagga 5700aaagagagag gaagggaaaa tggaaaatga ggaaagcgag aaagaagaaa tgagaaagag 5760gaaagggaga aagaaatgag cgagataaaa gacagaattt gagagaggga ggaagaaata 5820ggaaaagaga ggaaaggatg gagaaaagag agaaagaaag agagatgaaa gagagaaagg 5880agaaatgaaa tgagagagag agagagacac aaagagccag agagagaaga aaaaagggga 5940aagagaaaga gaaagaggaa ggctcctctt ggacacatct tcctttatct ttccctgggg 6000accgccaaag cctggtggca tactgtacat tctgtacact gttcattcaa aacaggctct 6060gtcttaaaga tggtctgagc ggtcagaaaa gggtattgtt aacttgtttg caaaactgcc 6120tcaggagagt gctgagtgcg tgaaagttgc tgcccgttaa ggagaagtct ctactacttg 6180tgatctcacc atcgaaaatt tctttaattg tctcctggtg ttctgggttt tgcagttttg 6240tttctaagga tacattcttg ggtgatgtca caaagtcccc aaagacacgg tggagctgtg 6300ttagatgggg aaagacagtc tgctgaggat ttatctggaa ctgtcagaag gaaaagaagg 6360taaatggggc acttgggaaa gtggcctcta gtttgacttc tggcttagca aaggttgtgg 6420ggagataagg catacacagt agttagcagg aggcaacagg gtcctgggag gacgcgaggc 6480agaaggagag gctgggctga cagcatgcaa tcattgcata gtctccaaag gagattgcaa 6540catggctgag ttttcagagg tcctacagag cccgtggtag agattctgtg ggttctgaga 6600caacttgact ttagccagat ggtatttgag taatctggga gagagaaaac agctacagca 6660aacagggcca catttagtga cgaaactctc actttgactg ttgagtcatt tgcagtgggc 6720cctgaggtca ggctggccct cagctcaaaa acaagcgagg aactgaagca attactcaga 6780taatccacag ccacagccac tggaaagggc cacatcccca gagacagcac agcaggggtg 6840ggggtggggc tatgagaaag ttagtgattg tagcagttat ctagaatgtg cggagcagag 6900gaggttacac aaaaacctag aatgtcattc aatgtgggaa accgagaggc tcccaagccc 6960taaaaggaac agtttgcttt cagccaaaat ggaaataaaa tttggggctt aaatctggca 7020aatgattcag accttctgtg taggtgtctt taaatgcaca gcagattgat tttcatgttg 7080gagtttattt gaactaaaag acagaaatgg tgaaaagcac acctgaagaa attgagatgc 7140tatgaataaa atcatttact tacagctatc acttaattag tacctccttc caccttgctg 7200atttattggg ctagtcaagg aagaaaagat cttccctcct ccttctctcc tcctccccct 7260cctctcctcc tcccctcccc tccttgacct tcctctcctc cttttccctc ctccccctct 7320tcttctcttc accccctcct cccctcccct cctctgtact cctccccttt cctcccaatc 7380tcttttttct cccccttctt ctctttctcc cccctcctct tccctcctct tcctccctcc 7440ctccctcctc ctcctcatcc tcctcttcct cttcatcctc ttctccttcc tccctctcct 7500cctcctcctt ttccagccct acctaccttc cctttcttct tcatttattc aaagtagctt 7560tgaacagcac tactcggttt agttgtgtat aaaaggaaaa tgcaggtcca agcagcttgg 7620ggaagattgc tttttgctct ctggaggcag atgatgacag ttcaagatca ttccttttgc 7680tccatgtcac aggaaggggg acatgccgaa tctaccagtt tgcagccacc tacacaggat 7740ccaccttcac ttctaaggaa atgtttggga agctacctac caaccacttc tggcatctca 7800tgggctagag gactcttaaa tggcactctt atttgtttaa taaaggaggt tgtgacgtgt 7860agttttaaat cccttccaca caacaattgc tactctctga ccaaaaaaga agggagacag 7920gatacggcta ggtgtctagt agactttacc actttgaaaa gccttaatat aaatcaggta 7980gatacatctt tttaacttat tcttgtaaag acaaaaacaa aactttattt ttatttgtgt 8040gtatgcttgt gtgtgtgtgc ctgtgtgtat accacatgtc gctggtgccg gagaacacca 8100gaagagggga cctgatctcc tggagctaaa gctatccatg gttctgagct gcctgatgtg 8160ggtgctggga acagaactct ggtcttctgc aagagcaaca agcctcctct taactacgaa 8220tctcctcccc atccccccaa atacatttaa ttattcattt tagcagcttt atttcgtaac 8280tacttatcac agcataaaac aaggatttta tatatattac atgcaatcga ggataagagt 8340tgaggggaga tgcgtgtgct ccttctgggt gtctgtgctt ttgaagaatg taagcagtgc 8400acaagggacc gaggcgtgcc tgtctgccag gagctgtctt cttcccttgg actctgagct 8460gagtgcagtg ctccgaagaa gtaaaagacg acctcatgaa gcaatgtctt caacccaaac 8520atgctgtcca gacaaagtcc agcttcatta gtgctctgag gagagactta ctgagcctca 8580ggaaagcccc cctcagcatg gcgaaagtcc actttgattg aagtgactcg aaagccatgg 8640cagtgcggcg gcggccgcgt ggagcttgtg ctcgagtcgg aagcggcatc tttgtcaggc 8700ggctgtgatt agcacgggga ggcaggactg gagtgaagga agagttgggg gcggggctta 8760gcgctctggt ctcctaagct gtagtcagcg cctcaagatt tgtaacctgc cttctgcctt 8820cccagccagg cagtcaagtg gctccaagct gaagactgca aagtgcccct aaccttttgg 8880ttatagcgag gctgaagaca ccgtgctctt tcatgaaagc cggatgtctg aaatccgatt 8940tgataaatat ggataaaacg tataacgctc gatcaatcga atcgaaggag ctcacgattg 9000gcaccacggc tttggggaca acagagtact gactcgttgg gaggacttgg atacttcccc 9060tcctcttcca tctcttcccc tttcctcact tcctcctcct tccttctcca ttttctccct 9120cttcactgtt tcttactatt tttacaaaag attttattta tttatttatt tatttattta 9180tttatttatt tatttattta tttatttaat gtatgcgagt acactgtagc tgtcttcaga 9240cacaccagaa gagggcgtca agttccatta gagatggttt cgagccacca tgtggttgct 9300ggggcctctg gaaggaccgc cagtgctctt aacccctgag ccatttctcc agtacccttc 9360tcaccgtttc tcttcaatct tcttcctctt ccttctccac tttccttgtc ttcttggttt 9420cattatcttt ctccctttct tcctcttctc cccttcttcc tcctccactg tagttttcct 9480tccctactct tttcctgcct ccctcctcct cccctctcat tccccctcct ctttcctcct 9540tctccctcct cctccttcct tctccctctc ccctctcccc tctcccttct cccttctccc 9600cctcctcttc ctctttctcc ttctccaccc ctcctgtcac agtatcaatg gcaagggtgt 9660tctagaatgg aggagtgtcc cctaggcact aacgaaagcc agttaggatg ctctgagacg 9720ggtacaattc agggagggcc gtggggatgg aagggttgtg ctgcgattca ttctggagca 9780acccccaggc agaatcatga ggttggttcc ggattcgcag ggcacaattc agaagaggaa 9840ggtttcagga aggacgagtt tgtctgagat aggagttaca tctgatgtct tggcagcaga 9900gccactgtac aagcgtgctt tattaaccac gtgggattaa atcttctttt aaatttattt 9960tcaactctta aggaaacgtg aactttcaca ttcaaattta gacttgcagc tcttatgggg 10020aaaaaaaggg gatcttaaga atattaagca taggcggctg gagagatggc tcagcggtta 10080agagcactct ctgctctccc agaggtcctg agttcaattc ctagcaacca cataatagtt 10140aacaacagtc tttaatgaat tctaatgccc tcttctggtg tgtctgaaga cagttacagt 10200gtactcatat aaataaaata aagaaattta aaaaaatgaa tattaggcat agattcctgg 10260atcctaagaa agccatcaga gctggagcca tgtgtgggat cctgcttggt gctggagggg 10320cagagttcat gcccccgggg tttttactta ttatcacatt ttcatcgttg ttttgaaaca 10380gggtcttgtg tggtccaggc tggccttgaa ctcatctttc agcctctacc tcacaggttc 10440tgggattact tggttcctaa aagtatctcc gtcaagctcc ctggtgttat ggctgtgcca 10500accaggaggg tctatacact cgctcaggta gagggagaag atccgaatct ctgacaggga 10560ctgctgcctc tcggggcaaa tggagtgaag gacagcggca gaaggattta ggaaagatgg 10620acgggagagt ggaaatgctg cagaagccag aaaacaaagc aggaagcctg ctgtccagtg 10680gggctcaaga gcggagggat gcgagggggc tgcgcaggaa catttagcgt ctgcgtctat 10740gggggtaggg gcggggtgcc agcacctagt cacctgaagg ggaaatgctt gcccagggag 10800caggtctcag tagctgacct agagaaagga gcggccccta cagaggagac acgggtcact 10860gtttgttaaa gtgaaggaga aataaatatt ctttcaaaga atcttaggtg agcccagttc 10920atctgcgctg tggaggcctg gggaacagtt aaaaagaccc tgacacacac ccaaggcaaa 10980caagcaacac acggctcctt ccgtaagggt ccatgattct ctgaagaatc agccccggaa 11040tcagccccgg aatcaggtag tccgtaaaca caatgagtgt tttactctgc agaagtccag 11100cctgctggcg tctcccatta ccaaaataga gggatagtca cgtgagctca ccggctcgat 11160ttaaggcacg tggttttcca gggtagatga gctttggctt ctggaaccat tatggggcac 11220gaaggatgga gccaggattt tttttttttt tttttttttc tattagcaat tgatttgctt 11280gggcttggct ggacttgccc agttcttagg cccagtcttc ttaactgccg atctgaagtc 11340tgtcatggag tcagcctagc cttctcactt cccttcagct cgaataggaa gaggaggtgc 11400acaccagatg gtctgagagc agggataaat ggtgtgcctt tgtctttcag tatttcgtta 11460ttttaagtag gaagatgctt ttctgtatta cattgcttgt gaaaccggaa gttgattcgg 11520ggcacaggac aatggatttg gtgttttgca aggactgttt cagaagagag aggagtggaa 11580gggtggttag agtgaggagt ggggtgggac gggatggggg aagagaagga agggccagac 11640aggctaggta gggctgagag gaggcggtgg gaacttcttg agttagcgca gcagtaaact 11700tggatgtgcg tgtatctttg tgatatatga cccggagccg tgtagctggc tccgatagta 11760ctgctaatgt cagtgtcggg gggggggggt cccatactgt tccacagggg ctgcacattc 11820ccatcgagag caggagggct cctctctcca tacatcctcg ccagcattcc ttgttgtttc 11880tgtgatgaca gggggtggga tgaaatctct ctgttggttt gagagaccgt gaagaagctc 11940aaccccagga cattttgcag tcttggaagg cagtgcctcc atgtggagcc gtggagccca 12000tctctgagtc caggtcactc ttgcagttcg cactcagctc ttcagatgca ggagagacgt 12060tggtgggaaa gcaagattgt ttgcttgttg agatagacac attctccaca caaaggctca 12120cgtggggcaa aggctgattg acgtacagcg ttcaggaacg cctgtggtag agctatgatt 12180agctgtctcc atctatgaag cagacaaaga gttataaaaa aaatcaatgt tttcaaattg 12240tcaaactttt aacccgacag caagcgctct gtccctgggc taatccctag ccctggtttc 12300ttgagatggg gtcttttgtg cactagactg gcctagaact cacgatctta gtgttccagc 12360ctcccagctg ctgggatgag ccgctataac cagtctgcct gccttcctaa attttaagtg 12420atgggaagtg ggggagaata cagtttaaag tatgcagatc tgagagcagg aacctggcaa 12480agccaagggg ccggagttac aggcggctaa catgggtgct gggaactgac ccaggtcctt 12540gagaggagca gtgtgtactc ttgaccaaac aggtccgtct ctccagtccc cgtagtatta 12600aaaataggta ctacgggcat ggtggtgcac acctttaatc ccagcactag ggaggcagag 12660gcaggtggat ttctgagttt gaggccagcc tggtctacaa aatgagttcc aggacagcca 12720cggctataca gagaaaccct gtcttgaaaa caaaacaaca acaaaatagg tactacaaag 12780cgatgtaatt gtgctcaaac atgcaaaccg aggggactgt atgcataaga aagagaaaga 12840cggccacact ggttctatct gggtgacagg aaatcagtat ttttattttt cacattcatt 12900tttttgttgt tgttgttgac acagtgattt ttctatcaaa aacattattt cttttatagt 12960tcccctgagg agctgttttt aaagccgtgc tttgaaaaac cattgaagga gcagaggcag 13020ggagactcct gtgtggcagt cggtgaagca ggccctctgc aggcaggctg gccctggact 13080tgggagtctc tttccctccc tcctgtgctc aaatagcaaa tgtcaggctt caatgtagct 13140agaaggttct agaatgatta agtttccaag gctgaagagc ttccctgttt gcctttcact 13200tccctggaga ggtcgttgtg tgttccggag tctgcaaggt gcctttggtg atgcgggtgg 13260ttcatctcgg gagattccgc ctggaggacc caagttcaag ccctgcctga gctacagagt 13320gactttcagg tcttctgcgc aattcagtga gacccagtct acaaataaaa agtaaaaaga 13380aggctgtgga tggaactcgg tggtagagtt ctgggtttac tccctagagg aggggagaag 13440gaggaggagg gaggaggaag aggaagaaag aagaagagaa gggaagagga gaaggaaggg 13500agggaagggg ctgacaagaa gagagaagag ggagggaggg gagggaaagg aaggggaaag 13560gaagggaggg aaggggctga caagaagaga gaagagggag ggaggggagg gaaaggaagg 13620ggaaagaaga gaagggtaag aagaaactgt tccaatggtc tgggccacag agtgatggcc 13680ttttgtggtg atcagctgta atccttgatt tgacacaacc tagaatctgg gaagcgagtt 13740tctgtgaagg agcattcaca ctggctggcc tgtgggcgtg catgtgggag actgtcataa 13800ttaggttcat taatacagga agtcccagcc cactacaaat ggcttcgttc catacccaag 13860agatgctaac tgtagacggt tggagaaagc aagcaagctg tggatacccc acgctctttc 13920acctcggctc ctggggggtg ggtgcactgt gtctcttggt attttaaagt cctgccttga 13980cgtccctgct gtgacagact gtaactggaa ttgtgagctt tagtccttta gttttctacg 14040ttggtttttc tcaggatatt ttatcgcagt aacagaaaca agaccaggac acttgatctc 14100ctctgatcaa cactgaagag ttacaaaaca ggctgaggaa acaaactttc ttctccctct 14160cccccttctg tccctcccct tccttctcgc tccctccctt gccccctctc tccctgtctc 14220tgtctctgtc tctgtctctg tctctgtctc tgtctctgcc tctcccctcc cctcccctcc 14280ctctgtctct gtctctgtct ctgtctctgt ctctgtctct gtctctgtcc ctttctcctc 14340tatctcctaa atggctggag gccatgctag ctcaatgttg aactttgaac acgtatttag 14400gaaatctttg ttcttaacag ttctgaagtg ctgaagtggt ggtttagtct ctcggcctga 14460caagctcact tcctctcact ctgtcttaat gaccaaatct gccatttccc taaaacagca 14520caggctccag ctccaggttg ctccggagcg gag                              14553

Example 15 - CHO Stable Site 2 Sequences - U.S. Pat. No. 9,816,110

-   <211> 4001-   <212> DNA-   <213> Cricetulus griseus-   <400> 1

(SEQ ID NO:58)

ccaagatgcc catcaactga ttaatagatg ataaaattat tgtacatttc agtgtaatat 60tattcagttt ttaagaaaaa tgaaattatg taataagcat gtaaatggat atatcttgaa 120acaaccattc cccattatat tacctaaaca ttgaaagtcc aaaatcatat gatcttttta 180gtggatctac taatcttttg ctatatgtat tttattgaac tacccatgga tgtgagataa 240ttggtaacaa cagcacatgg gagagcatgg gatcattcaa ggaagattag agagaatgca 300ttttttagga gataatggag gagcaataga aaggattaaa tgaggttact gatgaaagtg 360atggttagag aaggcaatat gaggagggat aactagcact tagggccttt tgaaaaagac 420atagagaaaa tactattgta gaaacttcct ataattggtg tatagttata tacaccaaag 480agctcagatg gagttaccct ataatggaaa tattaactac tttttatcac tgtgataaaa 540catcctgaac agagcaacat agattgggaa gcatttactt tggcttacag ttctaacggg 600ataaaaattc atgatgaaag aatgaatatg tcagcaaaca gcagtagcaa tggcctgaga 660agcaggtgag agctcacatc ttgaagtgta agaatgtagc agagagaaca aactgcaaat 720gaccagaaaa tgcttttgga tcagagccca tacccctctg actgacttct ccagaaattc 780tgaacaaata aaactcccca aacagagcca taactgaagg tccagtgtct gagactacta 840ggggtatttc ttattcaaac cactacaatg gggtgggggg agcaatcctc caagtaggca 900ctacacacag acaaataaaa actctagtaa ctggaatgga ttgacttatt tgaattactt 960gccagtggag ctacatagag cacaattatt gtatttaaat taccctttat gatcttacaa 1020aacttgacag taagatcata ttgctaaaga aaccacatat ttgaatcagg gaacatggtg 1080atatctagtt gttcttcaac tggaaacttc atgctttctg cccagcattc atgttgctgg 1140aaagagcaat gtacactacc agtgtagaaa ttaaatcatc aatcttatca agatgtggat 1200cctataagtt acaataaaaa ttagcctgat aagatatccc caccagaaga atattcacat 1260aaatgctatg ggagcaacaa gctattttct aaattagctt taatcctatt ctacaagaga 1320gaatccatat ctagaatagt tatagggatc aagaacccat ggcttgattg gtcataggcc 1380caatgggaga tcctaatatt attgttctac aaaatgaaaa taactcctaa tgacttgttg 1440ctgcagtaat aagttagtat gttgctcaac tctcacaaga gaagttttgt cttacaataa 1500atggcaatta aagcagcccc acaagattta tatcataccg atctcctcat ggcctatgca 1560tctagaagct aggaaacaaa gaggacccta agagagacat acatggtccc cctggagaag 1620gggaaggggg caagacctcc aaagctaatt gggagcatgg gggaggggag agggagttag 1680aagaaagaga aggggataaa aggagggaga ggaggacaag agagagaagg aagatctagt 1740caagagaaga tagaggagag caagaaaaga gataccatag tagagggagc cttgtatgtt 1800taaatagaaa actggcacta gggaattgtc caaagatcca caaggtccaa ctaataatct 1860aagcaatagt cgagaggcta ccttaaaagc ctttctctga taatgagatt gatgactacc 1920ttatatacca tcctagagcc ttcatccagt agctgatgga agcagaagca gacatctaca 1980gctaaacact gagctagttg cagacaggga ggagtgatga gcaaagtcaa gaccaggctg 2040gagaaacaca cagaaacagc agacctgaaa aaaatgttgc acatggaccc cagactgata 2100gctgggagtc cagcatagga cttttctaga aaccctgaat gaggatatca gtttggaggt 2160ctggttaatc tatggggaca ctggtagtgg atcaatattt atccctagtt catgactgga 2220atttgggtac ccattccaca tggaggaatt ctctgtcagc ctagacacat gggggaggtt 2280ctaggtcctg ctccaaataa tgtgttagac tttgaagaac tcccttgaga agactcaccc 2340tccctgggga gcagaaaggg gatgggatga gggttggtga gggacaggag aggaggggag 2400ggtgagggaa ctgggattga caagtaaatg atgcttgttt ctaatttaaa tgaataaagg 2460aaaagtaaaa gaagaaaaga aaacaggcca aaagattata aaagacagag gtggtgggtg 2520actataaaga aacactatta tctaaataaa aacatgtcag aagcacacat gaacttatag 2580tgtttatgaa agtatgtata ataactacat aatctcaagc caagaaaaaa atatcatctt 2640tcagtgatga aggtgatttt atttctccca gaattaaagc caaagaccta atgaaagtaa 2700ttatcttcaa aaggttgaaa atacatactt tgcaatacac agatctgcct agaaatctca 2760tgttcacaat acacatgatg ctcaattgaa ttccattcaa tgttacagtt tagataaaca 2820gtttgtagat aaactcacaa tgtatcattt ctttttattt tttgaccaaa cagcttctca 2880tctgttattc agaataattc ctcgatggca ggatatccat cccaattggg ggaaggggag 2940aatttgaaga aaacctagac cacatacata tttgccattg ggaaacaaag tctaaaatga 3000tgttgttcac atcttctcta ctagtcctct ccccgtccca aagaaccttg gtatatgtgc 3060ctcattttac agagagagga aagcaggaac tgagcatccc ttacttgcca tcctcaaccc 3120aaaatttgca tcattgctca gctctgccct tctcatatga cagttacaag tcaaggcttc 3180caaagtccct ctgtcatgtt tggtgtcaat agtttataca gatgacttca tgtcttcata 3240tctaatgtct tatatagatt aatattaaac aatgttattt ctctaaccac attttaaatt 3300aatttaaaaa tccattaatt gtgtctataa aatgcagaca gagtgctgag acacaatata 3360agcctgatga tctgaatttg aaactcacac ccaccacatg gagaatcaac ttccaaaaat 3420tttcctatta cttccacact tacaccattg tacaaacaca ataataatga acaaaatgaa 3480atgaaataaa aaattaagtc tctgtaggta atgctactgt gcagcaaaag taaaaatggc 3540agcttaagct tgctttatgg ttacacttta ccatcttcca ttaattataa ggacttcaat 3600catggcagaa ctatgctgtt attgtctcag tgtaacctaa ccaggtgttc cagatgttct 3660taatgtggac acctaaacta tttgatattt gggttaagat ctttccctct ttcagaagaa 3720acctcaggac agagggaatc ttgtctttta attttgagtc tgtagacttt ttccatttca 3780aatatacatg aaacaagtga tgaagaaaat taatcaaaag gtgggaattg caatgatatt 3840aggttcaata ttaagcttca atattatcat ggaatcgcct gttatacact gagtgtttgg 3900caataaggga tttttagaag aaggagtttt tattctcaac aggttcctta agtttagctc 3960aaataaatct aagcaatcca ctctagaatt aaatagtttc c                     4001

-   <211> 14931-   <212> DNA-   <213> Cricetulus griseus-   <220>-   <221> misc_feature-   <222> (2176)..(2239)-   <223> n is a, c, g, t or nucleotide is missing-   <400> 4

(SEQ ID NO:59)

catgtacact tatgcaagta tgatatggcc caacacagta ttttacacca atttttatct 60ataaaatata catgtacatc aaaatatatt attaataata acatcattat tctttctttc 120caagtaataa acacatacac tgaaattttg gttcttgtgg ataattttaa tgaaacagga 180aatgcaaatt tatcttagca tgtttacttc actttctttg catagataac cagtaatcac 240attgatggat catgtagtga aatgtatttt taggtatcta aggaattttg gcttcgtttt 300gtgcttgttg acactgaatt ctattcctaa caacagtgtg taaggattct gtctgatttc 360ttttaccagt atttgtccat ttgcattttc tttattattc atggctgctg ttctagaaag 420tggaaggtag tgtgtcaagt ctgtttaaca tgtttccctg atgatcagtg tcttaacacc 480tctctgagta catgttggcc aatgtcgttt ctagacccat ctattcttgc ttgacttatc 540ctggtacatg cctgccaaga aatttctcct catcctttct gtctcttcac tgatttactt 600gatgtgtgga tttcacattg atcatatgga aatagaagat acaattttct ttattcacag 660tttggaagac tttcaatctc atagatcatc attatttttt gctactgttc cctatgctat 720ggtgaaattt ccatttgaat aattgcttaa acaattaaca agaaagaatc tatttttact 780tgcaataact tccatttcag aacatttact acactgttac tatatccaaa aactagtttt 840atatatcatg tgagaaatga ctaattcata atttggccat gacatttttt tcagaaacag 900aaaaagtgac caatacatac acaatgctat aaatattaag acttcagcaa attaaatatt 960tattcatgat atcacataaa attcatttat tatgttttat ttaaatgtgt ttttaaaaca 1020gtggtatcac taaatattaa gttagatgtg tttatgtgct taatgaattt atattttaga 1080atgttataag ttgtatatag tcaaatatgt aataaatttt attttttagg tctttctcat 1140taaggtattt taattttggg tcccttttcc agagtgactc tagctcatga tgagttgaca 1200taaaaactaa acagtacaaa atgtacattg cattcagtat tgcacttgat ctttgcactg 1260aagtttgagt cagttcatac atttagtact tgggaagtac attaagctaa ctttcattgc 1320tctggcaaaa tgctcgataa gataagagtc tattgtggaa agccatggca gcaggaaagt 1380aagactgctg atgatgttta atccatagtc aagacgcaga aggagatgaa tgctggtatc 1440caacattttt tgctgttcat tttctctaga accctagtcc ataaagatgt atgacttgca 1500ttcaaaatgc gtccccttca gttgttcaac ttttctgtaa atatcctttc aggcatgtct 1560agaagattgt ttcgcaaata cttctcaatc cattcaagtt gatagtgcag attaatcact 1620gcagaataaa agcctgtaac ttggctcacg tgccaaggaa tatgcacact cctgacacat 1680caataagtaa atcaaagtgt agcttttgcc tttaacattg ccagacttat gtaatgttct 1740gcacgttctt cctccatcac tttttattct aatggtgttt ccttgacatt gaatcacgct 1800gtggaagctg cttagaatta acattgaaat ctactgatat atttatgatg cagcaattta 1860gatttactat tttacttaga attttttata attgagagaa tataatattt tcacagttat 1920ctatctgctg taaatagagg attttaaaaa aaatctctat aacttttttt tacaacacac 1980agtaaaatta agttaaaatt taataaagtc actatgttga tttcaaagtg tgctacgccc 2040acggtggtca cgcaggtgta gcagaagatg ccactaaggt gggctaaggc cgatgggttg 2100gggtctgcgc tccctggaga tgagccccag gcggttccct ggcaatcagc tgcgatcatg 2160atgcccgatg agccannnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2220nnnnnnnnnn nnnnnnnnnc tgggtgactt tatggaaaga atttgataga tttcatgatg 2280tagaagaatt ttattaggct tattttacag gagactaaga ccctgggacc taaagatatc 2340tgggtcctga gaatcaggaa atgggtagag acgtggttga tggtatgaga cagattttag 2400agaactctta gatcatgggc aatgaccgca atctgatgct tagaatagat catctataaa 2460caattatgct gttctttttc tttctgttgt atgatctgat gatgtagccc ccttgccaag 2520ttccctgatc ccccttgcca agttccctga ttgtaacagt atataagcat tgcttgagag 2580catattcaac tacattgagt gtgtctgtct gtcatttcct cgccgattcc tgatttctcc 2640ttgagccttt tcccttgttc tccctcggtc ggtggtctcc acgagaggcg gtccgtggca 2700aaagtgtata aatgttctaa aacatttgaa ctctaaaaca tgcaaaatga aaaattaaaa 2760taaataaaca tgaaaattaa aatatattag ctgctaaaag ttaaacaata ctatataata 2820ttttgttatt agaattcaaa atcacattag ttggatttaa tttgaacatt gcattctttc 2880aataataatt tcaataaaaa aagtttcccc atgatagtag aaaataataa catatgtatc 2940tatctattta tttaactaca catatatagc atttgtttca actaaaataa atgaatgagc 3000aaagcaccta agtaattggt gtctattata tttatgaagc caatagtttc aaataaatta 3060tcatgcataa ggaggtattg caaatgttaa accttttttg aaacagatat tcccagttac 3120agaaattata atttctaatc tttcctataa gtagaatgat gataattaat ataggccatt 3180tgtaaataat gttcagatta aaatattctc tatttcacta gagaagaatg atattaaatg 3240tattatattt tatttcccat tttgtttgca ccactattct atatccctca gcagtttaaa 3300tttgtttcac catatgtgtg tgtgtttgta tcttaaatat ggcactaaaa ttagaataat 3360ttaatataaa tctttaggag aaaagatatt gaattatttt atgttgatag gaaaatatct 3420tttaattgtc caagaatact ttttcttcta ttttaggact gatcagaccc aggactaata 3480ttttatatgt actaattcta tgtaccaaaa tatgttatta tctcatgaat tctgtctcaa 3540tattgaggta ataaaaatag tccatcatga actttaaaat taaaataatg attaattaat 3600ttttattcat attttgtttg tatgaatggt tatacatcac atgtgtgcct ggtgactgtg 3660aatgtcagga gaaggtatga aagccactgg aattggaata agagataata tttgagatgt 3720tatgtgggtg ctgagaatta gacgcaagcc atcttcaaga atagccagca tactatacca 3780ctgagtaatc cattcatccc tcaataatta tctttgtaga cagtaaatat atttctaaac 3840tataaatgac cagaaaaatt aatgtattat taatgaagac attcatctca tgtgacacac 3900ttcacctgtc taaatcagta acactctctc cactaattaa gattttctaa gtgcatgaca 3960cttactattt ctaaagctgt ccaatggggg ccagtcccca gtcagcaccc agtgagataa 4020tccatgaatg catttatatc ttaggaaaaa ttcttatcta tgtagtattt agaacatttt 4080catgtgaggg gataaacaag gaagcacaga tgctttctga tagaaacttt ctctttaatt 4140catctagaaa aaaaaaacct ctcaggaaaa tctctcttgc tctcctccca atgctctatt 4200cagcatcttc tccctactta attctagatc tttttctcta tgcctccttg ctgctgccct 4260gctggctctg ctctatgcct ccccatgtca cttttctttg ctatctcacc gttaccttct 4320ctgcctcact ctctgccttc ttctctgctt ctcacatggc caggctctgg acaattatag 4380ttatatgtta cattctcata acacatgata tgtcacatag tttctctcag gctagggata 4440tcacaatgac tggccaatga gcaagtggcc ttgcatgtag ctctaagttg gtgatggttc 4500ccagacagta agtagccatt tggttgaaat ttgaggttgg gtagtacatg aagactgaat 4560tttcttcaaa ctctggcctt gaaatagtaa aacaacacct atgaaaatga cgacctgtat 4620ttgtctttag aggcaaccac atattgtctg cagggcctgc tttgaatttg ctctgaagtt 4680agcttgtttg tgtaaaagga agaatcctat atcagcctga gaaatgtaaa atatcctagc 4740atttcaagtc atcaaaatta tatggagagt ataaatcatc cttctgacta ttcatagtca 4800tatttgtgtc caccaagtat aaaacacact accaaagggc tgtggaaaaa atcgccataa 4860ctgttcttat tagggaggca tagcagtggt acctgaggaa gttacagcaa caaccagtca 4920tccagtcaat aaccccatgg ctttgccact tggaggtacc caataatgtt tggctttgcc 4980gagtaggact ccaacaaatt cagagggtca atttttaaat gctggttgtc actgctgaac 5040agtcccattg ccctctgcat aattccacaa tggaaagctt tttacactga ttgccaatca 5100ttaaacagcc tactcagcat aaacaggtat gatattattc tgcattttgt tacattacta 5160gatgaattcc tatttcttcc tacaatagtg gaactgaaaa aagatacaca atcatactac 5220ccctctacta atcttatgac ttatatcatt tcaattttca gaccataatg caaactattg 5280accaaaacat gtgaagatga aaaatagaaa tgtagaataa tattacatat aaaaagaaaa 5340ggcggactta ttttgtttta tttcttagca tgcatagcaa tacatgattt gaggtttata 5400taataaaggg acaataaatc ttcaagaaac ttacccctac tgaattaaaa tattaaagaa 5460ggtcacacat ttactcaaat atattagact actgggcaaa tagacatgaa aagtagagtt 5520aatattgagg taggccttct gtgaaatgtc taaggaaatt atgtttcata cagtgtgtaa 5580ccaagtggga atcatatcag aaagcagtca aaagcttata ttacaagtaa cagatgcttg 5640gttatatgac ctcccagagc ttgactgtct atacacaaaa agtggtgtta ataaaactgt 5700aatttgggct atgttttttt aaatggcttc accaacatga aaggaaggga atgagcatgt 5760catggatgct tagagattat gcttccagca agaagaattg agctttggct cttattacag 5820aaacatgaca aggtgtgagt tttatttatt agaaattata taatatttta agctggggac 5880taaaaatttt attgaaacaa acaggcaagg gataggcatg tactagaagc aaaaatagga 5940tgtcaatgct gtaatgttat tttttggacc aaaatagtat ttcctataga aatgacaatg 6000atcttaggtt attattcttc ataaagatga caagttcaca agatatccta gttcattaaa 6060atcgttttag tcatttaata gagtgctgtg atagattaca caaaggaaag cacttacgat 6120gagaaataat gatatccaca attattttct taattcttag aaacattcta ttgttatatc 6180tcaatctcag aagccactta ttgctttatt attgaaacat atgaaattgt aagttatata 6240ttgtctatgg tgacatttca aagaacatgt gacgtacagt gtagcacaga taaagaacat 6300aactgcagct gaatcagtaa ctaaacttac atacattaaa tctgccatgt tggcaacagt 6360gtgtgcacta ccaaaggatg tactaatgct cacgacactc ccctatgtca ccctttgttc 6420atcattacat cataggtcta ttttgtttgc ttttgaaatc tagaccaagt cttttgtgtc 6480tttccaagca cagagctcat taatttacct catagacttg ttaaacttct tctggttcat 6540caattgaata gaaatactca ctactaatta tgtgagaccc tgccagtacc atagcacatg 6600gataattttt acataaaaca tgcatacaag taagattatt cagactgaac atgaatttta 6660gagaaatcag gaaggagtat atgggagtgg ttggagtgag actagagaaa tgtaattaaa 6720ctataatctc aatacaaaga tctactaagc aaaaaacatg aaacattgtc attcaagtga 6780aacatcagtc ttcaaattgg aaagatattt ttactaggaa aatgtctggt agatggttat 6840tatctagaaa acacaaaaat tagaaaacgg taaactttaa taaaaagaat aatacaatga 6900gactacatga aaagttctta actaatgaaa caaatatctt gaaacttttt tcttaaaagt 6960ttaatatcaa taaccatcat ggaaattcaa attaaaacta tttacatatt acccctgaaa 7020taataactaa tacccaataa aaataatata aacaaaaaat ggcaatgcat gccatcatgg 7080atttgggaga gagaatgttc attgcagttc tgaatggata ctggtgccac cacggtgaaa 7140atctctgtat aggtccttcc aaaagctgaa aatagacata tcacaagacc tgccacacat 7200ttttcaagca aatacccaaa ggactctacc tgactgcaga gacactttct cataaaatat 7260tattgttgat ctattcataa tatctggaaa atagaaacag ccaagatgcc catcaactga 7320ttaatagatg ataaaattat tgtacatttc agtgtaatat tattcagttt ttaagaaaaa 7380tgaaattatg taataagcat gtaaatggat atatcttgaa acaaccattc cccattatat 7440tacctaaaca ttgaaagtcc aaaatcatat gatcttttta gtggatctac taatcttttg 7500ctatatgtat tttattgaac tacccatgga tgtgagataa ttggtaacaa cagcacatgg 7560gagagcatgg gatcattcaa ggaagattag agagaatgca ttttttagga gataatggag 7620gagcaataga aaggattaaa tgaggttact gatgaaagtg atggttagag aaggcaatat 7680gaggagggat aactagcact tagggccttt tgaaaaagac atagagaaaa tactattgta 7740gaaacttcct ataattggtg tatagttata tacaccaaag agctcagatg gagttaccct 7800ataatggaaa tattaactac tttttatcac tgtgataaaa catcctgaac agagcaacat 7860agattgggaa gcatttactt tggcttacag ttctaacggg ataaaaattc atgatgaaag 7920aatgaatatg tcagcaaaca gcagtagcaa tggcctgaga agcaggtgag agctcacatc 7980ttgaagtgta agaatgtagc agagagaaca aactgcaaat gaccagaaaa tgcttttgga 8040tcagagccca tacccctctg actgacttct ccagaaattc tgaacaaata aaactcccca 8100aacagagcca taactgaagg tccagtgtct gagactacta ggggtatttc ttattcaaac 8160cactacaatg gggtgggggg agcaatcctc caagtaggca ctacacacag acaaataaaa 8220actctagtaa ctggaatgga ttgacttatt tgaattactt gccagtggag ctacatagag 8280cacaattatt gtatttaaat taccctttat gatcttacaa aacttgacag taagatcata 8340ttgctaaaga aaccacatat ttgaatcagg gaacatggtg atatctagtt gttcttcaac 8400tggaaacttc atgctttctg cccagcattc atgttgctgg aaagagcaat gtacactacc 8460agtgtagaaa ttaaatcatc aatcttatca agatgtggat cctataagtt acaataaaaa 8520ttagcctgat aagatatccc caccagaaga atattcacat aaatgctatg ggagcaacaa 8580gctattttct aaattagctt taatcctatt ctacaagaga gaatccatat ctagaatagt 8640tatagggatc aagaacccat ggcttgattg gtcataggcc caatgggaga tcctaatatt 8700attgttctac aaaatgaaaa taactcctaa tgacttgttg ctgcagtaat aagttagtat 8760gttgctcaac tctcacaaga gaagttttgt cttacaataa atggcaatta aagcagcccc 8820acaagattta tatcataccg atctcctcat ggcctatgca tctagaagct aggaaacaaa 8880gaggacccta agagagacat acatggtccc cctggagaag gggaaggggg caagacctcc 8940aaagctaatt gggagcatgg gggaggggag agggagttag aagaaagaga aggggataaa 9000aggagggaga ggaggacaag agagagaagg aagatctagt caagagaaga tagaggagag 9060caagaaaaga gataccatag tagagggagc cttgtatgtt taaatagaaa actggcacta 9120gggaattgtc caaagatcca caaggtccaa ctaataatct aagcaatagt cgagaggcta 9180ccttaaaagc ctttctctga taatgagatt gatgactacc ttatatacca tcctagagcc 9240ttcatccagt agctgatgga agcagaagca gacatctaca gctaaacact gagctagttg 9300cagacaggga ggagtgatga gcaaagtcaa gaccaggctg gagaaacaca cagaaacagc 9360agacctgaaa aaaatgttgc acatggaccc cagactgata gctgggagtc cagcatagga 9420cttttctaga aaccctgaat gaggatatca gtttggaggt ctggttaatc tatggggaca 9480ctggtagtgg atcaatattt atccctagtt catgactgga atttgggtac ccattccaca 9540tggaggaatt ctctgtcagc ctagacacat gggggaggtt ctaggtcctg ctccaaataa 9600tgtgttagac tttgaagaac tcccttgaga agactcaccc tccctgggga gcagaaaggg 9660gatgggatga gggttggtga gggacaggag aggaggggag ggtgagggaa ctgggattga 9720caagtaaatg atgcttgttt ctaatttaaa tgaataaagg aaaagtaaaa gaagaaaaga 9780aaacaggcca aaagattata aaagacagag gtggtgggtg actataaaga aacactatta 9840tctaaataaa aatatgtcag aagcacacat gaacttatag tgtttatgaa agtatgtata 9900ataactacat aatctcaagc caagaaaaaa atatcatctt tcagtgatga aggtgatttt 9960atttctccca gaattaaagc caaagaccta atgaaagtaa ttatcttcaa aaggttgaaa 10020atacatactt tgcaatacac agatctgcct agaaatctca tgttcacaat acacatgatg 10080ctcaattgaa ttccattcaa tgttacagtt tagataaaca gtttgtagat aaactcacaa 10140tgtatcattt ctttttattt tttgaccaaa cagcttctca tctgttattc agaataattc 10200ctcgatggca ggatatccat cccaattggg ggaaggggag aatttgaaga aaacctagac 10260cacatacata tttgccattg ggaaacaaag tctaaaatga tgttgttcac atcttctcta 10320ctagtcctct ccccgtccca aagaaccttg gtatatgtgc ctcattttac agagagagga 10380aagcaggaac tgagcatccc ttacttgcca tcctcaaccc aaaatttgca tcattgctca 10440gctctgccct tctcatatga cagttacaag tcaaggcttc caaagtccct ctgtcatgtt 10500tggtgtcaat agtttataca gatgacttca tgtcttcata tctaatgtct tatatagatt 10560aatattaaac aatgttattt ctctaaccac attttaaatt aatttaaaaa tccattaatt 10620gtgtctataa aatgcagaca gagtgctgag acacaatata agcctgatga tctgaatttg 10680aaactcacac ccaccacatg gagaatcaac ttccaaaaat tttcctatta cttccacact 10740tacaccattg tacaaacaca ataataatga acaaaatgaa atgaaataaa aaattaagtc 10800tctgtaggta atgctactgt gcagcaaaag taaaaatggc agcttaagct tgctttatgg 10860ttacacttta ccatcttcca ttaattataa ggacttcaat catggcagaa ctatgctgtt 10920attgtctcag tgtaacctaa ccaggtgttc cagatgttct taatgtggac acctaaacta 10980tttgatattt gggttaagat ctttccctct ttcagaagaa acctcaggac agagggaatc 11040ttgtctttta attttgagtc tgtagacttt ttccatttca aatatacatg aaacaagtga 11100tgaagaaaat taatcaaaag gtgggaattg caatgatatt aggttcaata ttaagcttca 11160atattatcat ggaatcgcct gttatacact gagtgtttgg caataaggga tttttagaag 11220aaggagtttt tattctcaac aggttcctta agtttagctc aaataaatct aagcaatcca 11280ctctagaatt aaatagtttc ctaagggcac agctatgaat agagctcaat ttacatataa 11340aattttgttc accatttatg tcattccagt tttcattagt acaaggaaaa tacaaaatat 11400ttagatgtca atatcaagtg aatagttcat ctcctttttt aatatatatc acctaaatca 11460ccattttctc agaaaaatct ggcctgaagt tctgtctgga acttcaacat gaaaaatatg 11520cacagcttgc tattataaat cctagttgat ttttaagatt catgtctggt gtctgactca 11580gaggggccag aggctagaca aatatttttt gaatcttcat tgtgaagatt tttaatgatt 11640attttaatat aaataacaaa gatgatggat aatgtaactt tgtacagttc atagacgctg 11700aactactttg tgcttaaaat gttagttccc tatcataaat gataggtgat aagtgtatgt 11760ttaatacttt ccctctgagc tatattcatg tactagagaa ttattttaaa catgaaaaga 11820ctgtgtttat agtctcagct cctgagaact ggtccaacct taggcaggtg aatgccagga 11880gcaacgtttt tcttctacag aggatgcttt gctgccaagc aacctggttg tgtggaaatg 11940ttcctttttt aatcaagttt aaagggtctt catcatgctg ttgctccaca tattttcagg 12000ttagagcttg gtccttggag tattatcttt taccagaaaa ttcatagtat tctttcaata 12060actaacaact aaacttttcg ataaaaaaga attggaattt caattttaaa gcctgagtaa 12120aattcttgtg aatcaggata ttttatttta agtcttatct tttaaaaagt tattttattt 12180tttaaaaaat tataatatac tttcataatt tccctccttc acttttcttt acaaacactt 12240ctatagatca ccatgtgttt ttttttttac atttatggcc tctttctgtt cattgttatt 12300acatacaaat agtcttgcct atagaagaac accacaattt gttacctgat aacaaattat 12360caacccttaa aacctacaaa ctattgatat tactgaaaag actatactta tagatgtaaa 12420gatatatgtg tgtgcacata tatagataca catatatgta ggatttttaa ttttagattt 12480tagacatcaa aattatttat atgactgaga aactagacac tataaatgag cattcagtat 12540tcaacaccgt gattttagat attgtcacaa tgacagaaaa ttttcttata gaaaatttta 12600agttttgtga ttgctctgtg cacttagtga agtctcacag aaaaagaatc atagtatttt 12660tagtttataa taaaaagtac atataattaa aatggttggc acaaaacaac atttgagcat 12720ttttcctatt tactatcaag tagtatcatt ttgaaataat aatttgacta gtttcaaaaa 12780tgaaaacaaa atttaaacta aatgcctaat ctagcctgat aacattttta tgaatgaaat 12840tattcaatag tgttatcaat taggggccca aaacttttcc taaaataaaa cttttaattt 12900ttttccattt ttatttaaat tagaaacaaa attgttttac atgtaaatca gagtttcctc 12960accctcccct tctccctgtc cctcactaac accctacttg tcccatacca tttctgctcc 13020ccagggaggg tgaggccttc catggggaaa cttcagagtc tgtctatcct ttcggatagg 13080gcctaggccc tcacccattt gtctaggcta aggctcacaa agtttactcc tatgctagtg 13140ataagtactg atctactaca agagacacca tagatttcct aggcttcctc actgacaccc 13200atgttcatgg ggtctggaac aatcatatgc tagtttccta ggtatcagtc tggggaccat 13260gagctccccc ttgttcaggt caactgtttc tgtgggtttc accaccctgg tcttgactgc 13320tttgctcatc actcctccct ttctgtaact gggttccagt acaattccgt gtttagctgt 13380gggtgtctac ttctactttc atcagcttct gggatggagc ctctaggata gcatacaatt 13440agtcatcatc tcattatcag ggaagggcat ttaaagtagc ctctccattg ttgcttggat 13500tgttagttgg tgtcatcttt gtagatctct ggacatttcc ctagtgccag atatctcttt 13560aaacctacaa gactacctct attatggtat ctcttttctt gctctcgtct attcttccag 13620acaaaatctt cctgctccct tatattttcc tctcccctcc tcttctcccc ttctcattct 13680cctagatcca tcttcccttc ccccatgctc ccaagagaga tgttgctcag gagatcttgt 13740tccttaaccc ttttcttggg gatctgtctc tcttagggtt gtccttgttt cctagcttct 13800ctggaagtgt ggattgtaag ctggtaatca tttgctccat gtctaaaatc catatatgag 13860tgatgtttgt ctttttgtga ctgggttacc tcactcaaaa tggtttcttc catatgtctg 13920tggatttcaa tagcacaaac aacatacagt atcttggggc aacactaacc aaacaagtga 13980aagaccagta tagcaagaac tttgagttta aagaaagaaa ttaaagaaga taccagaaaa 14040tggaaagatc tcccatgctc tttgataggc agaatcaaca tagtaaaaat ggcaatcttg 14100ccaaaatcca tctacagact caatgcaatc cccattaaat accagcacac ttcttcacag 14160acctgaaaga ataatactta actttatatg gagaaacaaa agacccagga taggccaaac 14220aaccctgtac aatgaaggca cttccagagg catccccatc cctgacttca agctctatta 14280tagagtaata atcctgaaaa cagcttggta atggcacaaa aatagacagg tagaccaatg 14340gaattgagtt gaaaaccctg atattaaccc acatatctat gaacacctga ctttgacaaa 14400gaagctaagg ttatacaatg taagaaagaa agcatcttca acaaatcgtg ctggcataac 14460tggatgctgg catgtagaag actgcagata gatccatgtc taatgccatg cacaaaactt 14520aagtccaaat ggatcaaaaa cctcaacata aatccagcca cactgaacct catagaagag 14580aaagtgggaa gtatccttga ataaattggt acaggagacc acatcttgaa cttaacacca 14640gtagcacaga caatcagatc aataatcaat aaatgggacc tcctgaaact gagaagcttc 14700tgtaaggcaa tggataagtc aacaggacaa aatggcagcc cacggaatgg gaaaagatat 14760tcaccaatcc tatatctgac agagggctgc tctctatttg caaagaacac aataagctag 14820tttttaaaac accaattaat ccgattataa agttgggtag agaactaaat aaagaattgt 14880taacagagca atctaacttg gcagaaagac acataagaaa gtgctcacca t          14931

It is to be understood that the description, specific examples and data,while indicating exemplary embodiments, are given by way of illustrationand are not intended to limit the present inventions. Various changesand modifications within the present invention, including combiningembodiments in whole and in part, will become apparent to the skilledartisan from the discussion, disclosure and data contained herein, andthus are considered part of the inventions.

1-28. (canceled)
 29. A eukaryotic cell comprising a polynucleotidecomprising (i) a promoter, (ii) an intron, (iii) a first internalribosome entry site, (iv) a first polynucleotide encoding AAV Capprotein, (v) a second internal ribosome entry site, (vi) a secondpolynucleotide encoding AAV Cap protein, and (vii) a polyadenylationsite.
 30. The cell according to claim 29, wherein the (i) promoter, (ii)intron, (iii) first internal ribosome entry site, (iv) firstpolynucleotide encoding AAV Cap protein, (v) a second internal ribosomeentry site, and (vi) second polynucleotide encoding AAV Cap protein areoperably linked.
 31. The cell according to claim 30, wherein thepolynucleotide is integrated into a cell genome. 32-36. (canceled)
 37. Aeukaryotic cell comprising (A) a first polynucleotide comprising (i) apromoter, (ii) an intron, (iii) an internal ribosome entry site, (iv) apolynucleotide encoding AAV Cap protein, and (v) a polyadenylation site;and (B) a second polynucleotide comprising (i) a promoter, (ii) anintron, (iii) a polynucleotide encoding AAV Cap protein, and (iv) apolyadenylation site.
 38. The cell according to claim 37, wherein the(i) promoter, (ii) intron, (iii) internal ribosome entry site and (iv)polynucleotide encoding AAV Cap protein of (A) first polynucleotide areoperably linked, and wherein the (i) promoter, (ii) intron, and (iii)polynucleotide encoding AAV Cap protein of (B) second polynucleotide areoperably linked. 39-44. (canceled)
 45. A eukaryotic cell comprising (A)a first polynucleotide comprising (i) a promoter, (ii) an intron, (iii)an internal ribosome entry site, (iv) a polynucleotide encoding AAV Capprotein, and (v) a polyadenylation site; and (B) a second polynucleotidecomprising (i) a promoter, (ii) an internal ribosome entry site, (iii) apolynucleotide encoding AAV Cap protein, and (iv) a polyadenylationsite.
 46. The cell according to claim 45, wherein the (i) promoter, (ii)intron, (iii) internal ribosome entry site and (iv) polynucleotideencoding AAV Cap protein of (A) first polynucleotide are operablylinked, and wherein the (i) promoter, (ii) internal ribosome entry site,and (iii) polynucleotide encoding AAV Cap protein of (B) secondpolynucleotide are operably linked.
 47. The cell according to claim 46,wherein at least one polynucleotide is integrated into a cell genome.48. The cell according to claim 45, wherein the cell is a CHO cell or aBHK cell.
 49. The cell according to claim 45, wherein the cell is a HEK293 cell or a human amniotic cell. 50-55. (canceled)
 56. A method ofproducing adeno-associated virus (AAV) Cap protein in cell culture,wherein the method comprises the steps of: providing eukaryotic cellsaccording to claim 29, ; and culturing the cells in a culture medium toallow the cells to produce AAV Cap protein.
 57. The method according toclaim 56, wherein the (i) promoter, (ii) intron, (iii) first internalribosome entry site, (iv) first polynucleotide encoding AAV Cap protein,(v) a second internal ribosome entry site, and (vi) secondpolynucleotide encoding AAV Cap protein are operably linked.
 58. Themethod according to claim 57, wherein the polynucleotide is integratedinto a cell genome.
 59. The method according to claim 56, wherein thecell is a CHO cell or a BHK cell.
 60. The method according to claim 56,wherein the cell is a HEK 293 cell or a human amniotic cell. 61-63.(canceled)
 64. A method of producing adeno-associated virus (AAV) Capprotein in cell culture, wherein the method comprises the steps ofproviding eukaryotic cells according to claim 37; and culturing thecells in a culture medium to allow the cells to produce AAV Cap protein,wherein the polynucleotide allows for production of AAV Cap proteinsVP1, VP2 and VP3.
 65. The method according to claim 64, wherein the (i)promoter, (ii) intron, (iii) internal ribosome entry site and (iv)polynucleotide encoding AAV Cap protein of (a) first polynucleotide areoperably linked, and wherein the (i) promoter, (ii) intron, and (iii)polynucleotide encoding AAV Cap protein of (b) second polynucleotide areoperably linked. 66-71. (canceled)
 72. A method of producingadeno-associated virus (AAV) Cap protein in cell culture, wherein themethod comprises the steps of providing eukaryotic cells according toclaim 45; and culturing the cells in a culture medium to allow the cellsto produce AAV Cap protein, wherein the polynucleotide allows forproduction of AAV Cap proteins VP1, VP2 and VP3.
 73. The methodaccording to claim 72, wherein the (i) promoter, (ii) intron, (iii)internal ribosome entry site and (iv) polynucleotide encoding AAV Capprotein of (a) first polynucleotide are operably linked, and wherein the(i) promoter, (ii) internal ribosome entry site, and (iii)polynucleotide encoding AAV Cap protein of (b) second polynucleotide areoperably linked.
 74. The method according to claim 72, wherein thepolynucleotide is integrated into a cell genome. 75-81. (canceled)